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The ability to hear two sounds at once is one of the most useful 
re perties of the human ear. This ability—noted by G. S. Ohm more 
an a century ago—allows us to respond selectively to certain com- 
Ponents of the total acoustic pattern and to ignore others, to hear our 
‘end’s voice in spite of a background of noise, to follow the theme x 
and still hear the obbligato. But, remarkable as this ability is, it is w 
not infallible, and sometimes we are unable to hear one sound because è 
another sound interferes. The selective mechanism is studied in its N 
Simplest form when two puré tones are introduced into the ear and the > 
istener is asked to report the presence or absence of one or the other 
of the tones, In such experiments it is found that the ear is not a perfect N 
analyzer, for some tones obscure the perception of others. This inter- N 
erence is called auditory masking. 
uditory masking is usually defined as “the shift of the threshold 
of audibility of the masked sound due to the presence of the masking _ 
Sound” (23), Following this definition, the measurement of auditory 
Masking is a straightforward experimental procedure. First, the just- 
audible sound-pressure is determined in the quiet. Then the interfering 
Sound is introduced and the listener’s threshold is again determined. 
olds difference, in decibels, between the quiet and the masked thresh- 
ti S is the measure of the amount of masking produced by that par- 
goular type and intensity of interference for that particular masked 
ee The notion can be applied to complex noises like speech just 
Teadily as it is applied to pure tones. The introduction of an inter- 
cring noise makes it necessary to raise the intensity of the speech for 


3 . 
Much of the research reviewed here was begun under an OSRD contract and is con- 


ating under contract with the U. S. Navy, Office of Naval Research (Contract N5ori- 
» Report PNR-23). s 
dn of the research reviewed has been published previously but becausė of miia 
ence pease some of the publications are available to only a limited psychological audi- 
< In some cases it has been impossible even to refer to the original source. 
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it to be understood. The necessary increment is taken as the measure 
of the masking produced by the noise. 

Knowledge of the ear’s susceptibility to interference is obviously a 
matter of considerable practical value. Much of our present knowledge 
has grown out of the job of developing the telephone, although research 
on speech-communication problems in World War II has supplemented 
the earlier investigations. In the course of this research many different 
sounds have been studied to determine the interference they might 
produce, and a review of the results shows the masking of speech to 
depend on three characteristics of the masking sound: (1) its intensity 
relative to the intensity of the speech, (2) its acoustic spectrum, and (3) 
its temporal continuity. In the following pages we will review a variety 
of sounds which are, or might be, encountered, and in every case the 
disruption of vocal communication is determined by these three at- 
tributes. Human speech is most seriously masked by an uninterrupted 
noise which has its power concentrated in the lower third of a spectrum 
covering the frequency-range from 100 to 4000 or 5000 cycles. 


THE SPECTRUM OF SPEECH 


A noise which interferes with our perception of one sound may not 
interfere with our perception of another. Consequently, it is of some 
importance to examine the spectrum of human speech, since speech is 
the sound of direct concern for the present discussion. 

Physically, speech consists of vibrations varying widely and rapidly 
in their intensity and frequency (2, 4, 19). In general, the vowels can 
be analyzed into discrete component frequencies, with fundamentals in 
the neighborhood of 100-200 cycles per second. Some consonants, 
however, contain energy distributed almost continuously through the 
high frequencies. Thus the energy is constantly shifting from one range 
of iay to another as the talker proceeds from one sound to the 
next. 

One attempt to describe the energy in speech as a function of fre- 
quency borrows the concept of the spectrum from optics (2). A typical 
“long-interval speech spectrum” is shown in Fig. 1 (17). A crew of 
seven young men spoke an English sentence in a “conversational” 
voice, and a condenser microphone located 18 inches in front of the 
lips pickedf'up the sound-waves and converted them into voltages 
which were analyzed by an audio-spectrometer. At a distance of 18 
inches, the total pressure integrated over the entire range of frequencies 

he standard reference pressure of 0.0002 


was about 76 db above t 
dyne/cm*. The spectrum shown in Fig. 1 represents the root-mean- 
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Square pressure in frequency-bands one cycle wide. For purposes of 
Orientation the minimum audible field for pure tones is indicated at the 
bottom of the graph (18). This type of representation shows the long- 
time average distribution of speech-energy over the range of audible 
frequencies, but it is necessary to remember that the “instantaneous” 
Spectrum is constantly shifting* and is seldom, if ever, similar to the 
long-time average. 


RMS PRESSURE PER CYCLE 
10 09+0 0008 orng scut 


‘000 
FREQUENCY IN CYCLES PER SECOND 


Fi Fis. 1, Tue LONG-ĪNTERVAL SPECTRUM OF CONVERSATIONAL AMERICAN SPEECH 
OR SEVEN MALE Voices, EXPRESSED IN TERMS OF THE RMS PRESSURE IN FREQUENCY- 


ANDS ONE CYCLE WIDE. 
he over-all level of the speech 18 inches in front of the talker’s lips was 76 db re 


0002 dyne/emy, 


fant In terms of a long-time average, therefore, speech isa low-frequency 
Oise. Most of the power in the speech-wave is carried by frequencies 
celow 1000 cycles. Thus, even in ignorance of the ear’s susceptibility, 
hae nature of speech itself suggests that communication is most seri- 
‘end disrupted by sounds which have their energy concentrated in the 
quencies below 1000 cycles. 


ARTICULATION TESTING z 


When the over-all pressure of the speech-wave is less than about 5-10 
re 0.0002 dyne/cm?, the speech is inaudible. As the intensity of the 
ech is gradually increased the presence of spoken sounds can be de- 
.°Cted even though none of the words is distinguishable. This intensity 
Conveniently referred to as the threshold of detectability for speech (3). 
PProximately 8 db above the threshold of detectability, the sounds 
eee to be perceived as words. If the spoken material is continuous 
Scourse, it is possible, though difficult, to understand the gist of the 
R. K. Potter 


db 
Spe 


* 
as,“ recent attempt to include the temporal dimension is reported by 
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passage. This intensity is called the threshold of perceptibility. If the 
intensity is increased about 4 db more, the listener is able to obtain 
without perceptible effort the meaning of almost every sentence and 

; phrase of the connected discourse. Above this level, the threshold of 
intelligibility, the intensity of the speech can be increased until it 
-becomes painful at about 140 db (1). These thresholds are distinct and 
reliable, and different listeners will agree on their value. 

Any one of the three thresholds can be used to determine the shift 
in the threshold due to the presence of a masking sound. With listeners 
instructed to determine the threshold of perceptibility in the presence 
of noises of variable intensity, functions of the type shown in Fig. 2 
are obtained. Thus a pure tone of 1000 cycles at an intensity of 100 db 
shifts the threshold of perceptibility 18 db. A tone of 300 cycles at 
100 db shifts the threshold 42 db, and a random noise of 100 db pro- 
duces a 68-db shift. 


MASKING OF SPEECH 
BY NOISE 


MASKING BY 
300 CPS 


MASKING IN DECIBELS 
è 


MASKING BY 
1000 CPS 


o 20 40 69 eo 100 120 140 


INTENSITY OF MASKING SOUNO (0B RE 0.0002 DYNE/CM") 


Fic. 2. THE SHIFT IN THE THRESHOLD OF PERCEPTIBILITY FOR SPEECH AS A FUNC- 
TION OF THE INTENSITY OF DIFFERENT MASKING SOUNDS. 


Random noise and pure tones having frequencies of 300 and 1000 cycles were used 
to mask the speech, 


For a precise determination of the threshold, however, more elab- 
orate methods are available. A talker reads out a list of discrete words, 
and listeners record what they hear. The percentage of the words heard 
correctly is taken as the articulation score. This method stands in the 
same relation to the threshold-methods as does the method of constant 

` stimuli to the method of average error in classical psychophysics. By 
changing the intensity of either the speech or the masking sound 4 
series of articulation scores ranging from 0% (no words heard) to 100% 
(all words heard correctly) is obtained. The 50% point on this function 
can be regarded as the threshold. 


It is obvious that the results of these tests depend upon the type 
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of spoken material, the proficiency of talker and listeners, the charac- 
teristics of the equipment, etc. During the course of the development'of 
articulation testing methods, from their first use by the Bell Telephone 
Laboratories (4) to their present wide application in communication 
problems, many variations have developed and considerable work has 
been devoted to their evaluation and standardization (3). The method 
adopted in the research reviewed here is recommended by its economy 
of time and personnel. Talkers are replaced by their recorded voices. 
Lists of difficult words are phonographically recorded in several differ- 
ent scramblings by each of five talkers. Between two and four trained 
* listeners are used. As the records are played, the listeners follow the 
words with the aid of a check-list, uncovering each word after they hear 
it spoken. The listeners indicate by check-marks or manual counters 
whether or not they hear the word correctly. Since each listener must 
establish his own criterion for “hearing” a word, training is necessary 
efore consistent results are obtained. A comparison with results from 
More formal articulation tests, however, indicates that with trained 
listeners the abbreviated procedure gives valid articulation scores. 
It will be noted that the masking produced by various noises can 
compared without obtaining the complete masking functions il- 
lustrated in Fig. 2. Single points on each function, if properly chosen, 
Indicate the relative disruption of vocal communication which the 
ifferent noises produce. Thus, if a quick check is desired, the threshold 
of Perceptibility or intelligibility can be determined. If a more accurate 
Value is needed, complete articulation functions can be run. One meth- 


od is useful when a large number of variables need to be surveyed; the 


Other gives more precise information when the significant variables 


ave already been determined. 


if 
MASKING SPEECH BY TONES 


Sounds which occur around us every day are customarily classified 
aS either tones or noises. In this usage the word “tone” implies a har- 
Monic relation between component frequencies. The word “noise” 
indicates that the component frequencies are dissonant, or randomly 
related, This is a classification of convenience, and a rigorous attempt 
to apply it to all the sounds which can be produced would quickly 
reveal borderline cases. Human speech, for example, is composed of 
Sounds which have tonal characteristics (e.g. the vowels) and sounds 
With noisy characteristics (e.g. unvoiced plosives and fricatives), as 

cll as many sounds which have both tonal and noisy components. 
R order to classify the variety of sounds which. might interfere with 
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speech, at least three categories are required: (1) tones, (2) noises, and 
(3) voices. This rather arbitrary classification of the various sounds 
will serve to organize the following discussion. 

Pure tones. As we would expect from the characteristics of speech, 
and as was indicated in Fig. 2, tones of low frequency produce more 
interference with speech than do tones of high frequency. This fact is 
supported by a very thorough study reported by Stevens, Miller and 
Truscott (21). These investigators found that for weak intensities the 
maximal masking is produced by sine waves in the vicinity of 500 
cycles, whereas at high intensities the greatest masking occurs near 
300 cycles. This effect is presumably due to the rapid spread of masking 
into the higher frequencies as the intensity of the masking tone is in- 
creased (22). This upward spread is aided by distortion in the ear which 
produces aural harmonics at high sound-levels (4, 20), and also by the 
characteristics of the response of the individual auditory-nerve fibers 
(9, 10, 13). 

Complex Tones. Stevens, Miller and Truscott also determined the 
masking produced by square waves and by repetitive pulses of 10- 
microsecond duration. Square waves are less critical than sine waves 
as to frequency; fundamental frequencies between about 80 and 400 
cycles mask with approximately equal effectiveness. No shift in the 
optimal frequency was observed as the intensity of the complex tones 
was increased. If the peak amplitude of the pulses is held constant, the 
greatest interference is produced by a pulse-repetition-frequency of 
about 200 pulses per second. 

When the various wave-forms are equated for sound-pressure level, 
the sine wave is about 7 db less effective than the square wave, and the 
square wave about 7 db less effective than the pulses. This comparison 
is made for the low-frequency range in which the tones are most ef- 
fective. At frequencies above about 1000 cycles all the tones are equally 
ineffective. 

Before proceeding to discuss other types of interference we should 
pause to note two important generalizations supported by these data. 
First, low frequencies are more effective than high, and second, the 
greater the harmonic content of the tone the better its spectrum blan- 
kets the frequency-range of speech. 

Patterns of Tones. Essentially the same considerations apply to the 
case of tones which are changing in frequency. Tests have been con- 
ducted with signals which changed abruptly or gradually in frequency, 
which covered wide or limited frequency-ranges at slow, intermediate 
or rapid rates, which varied in complexity or were accompanied by 


- 
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steady tones, etc. In all the variations of tonal signals, the significant 
dimensions for masking were the frequency of the fundamental and the 
richness of the harmonic content. 

Some examples should document these statements. One series of ex- 
periments was conducted with warbling tones—the tone rose slowly 
from the lowest to the highest frequency, then dropped suddenly back to 
the lowest frequency and rose again. The tone was produced by a relax- 
ation oscillator and was rich in harmonics. Such a warble can be varied 
in center-frequency, range or rate. With the speech (heard binaurally 
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INTENSITY OF MASKING SOUND (0B RE 0.0002 DYNE/CM®) 


Fic. 3. Tue PERCENT OF THE Worps WHICH WERE CORRECTLY HEARD AS A Func- 
TION OF THE INTENSITY OF VARIOUS TONAL MASKING SOUNDS. 

Different center-frequencies of a warbling sound are compared in A; different ranges 
in B, different rates in C. In D the effect of adding a steady tone to a stepped pattern 
Of tones is illustrated. The level of the speech was held constant at 95 db. 


in dynamic earphones PDR-3) held at 95 db, the intensity of the warble 


Was raised in 6-db steps until the functions shown in Fig. 3 had been 
obtained. Fig. 3-A compares warbles which had different center- 
frequencies. The high-pitched warble (550 to 750 cycles) resulted in 
the least interference and 50% of the words were still intelligible when 
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speech, at least three categories are required: (1) tones, (2) noises, and 
(3) voices. This rather arbitrary classification of the various sounds 
will serve to-organize the following discussion. 

Pure tones. As we would expect from the characteristics of speech, 
and as was indicated in Fig. 2, tones of low frequency produce more 
interference with speech than do tones of high frequency. This fact is 
supported by a very thorough study reported by Stevens, Miller and 
Truscott (21). These investigators found that for weak intensities the 
maximal masking is produced by sine waves in the vicinity of 500 
cycles, whereas at high intensities the greatest masking occurs near 
300 cycles. This effect is presumably due to the rapid spread of masking 
into the higher frequencies as the intensity of the masking tone is in- 
creased (22). This upward spread is aided by distortion in the ear which 
produces aural harmonics at high sound-levels (4, 20), and also by the 
characteristics of the response of the individual auditory-nerve fibers 
(9, 10, 13). 

Complex Tones. Stevens, Miller and Truscott also determined the 
masking produced by square waves and by repetitive pulses of 10- 
microsecond duration. Square waves are less critical than sine waves 
as to frequency; fundamental frequencies between about 80 and 400 
cycles mask with approximately equal effectiveness. No shift in the 
optimal frequency was observed as the intensity of the complex tones 
was increased. If the peak amplitude of the pulses is held constant, the 
greatest interference is produced by a pulse-repetition-frequency of 
about 200 pulses per second. 

When the various wave-forms are equated for sound-pressure level, 
the sine wave is about 7 db less effective than the square wave, and the 
square wave about 7 db less effective than the pulses. This comparison 
is made for the low-frequency range in which the tones are most ef- 
fective. At frequencies above about 1000 cycles all the tones are equally 
ineffective. 

Before proceeding to discuss other types of interference we should 
pause to note two important generalizations supported by these data. 
First, low frequencies are more effective than high, and second, the 


greater the harmonic content of the tone the better its spectrum blan- 
kets the frequency-range of speech. 


Patterns of Tones. Essentially the same considerations apply to the 


case of tones which are changing in frequency. Tests have been con- 


ducted with signals which changed abruptly or gradually in frequency; 


which covered wide or limited frequency-ranges at slow, intermediate 


or rapid rates, which varied in complexity or were accompanied by 
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steady tones, etc. In all the variations of tonal signals, the significant 
dimensions for masking were the frequency of the fundamental and the 
richness of the harmonic content. 

Some examples should document these statements. One series of ex- 
periments was conducted with warbling tones—the tone rose slowly 
from the lowest to the highest frequency, then dropped suddenly back to 
the lowest frequency and rose again. The tone was produced by a relax- 
ation oscillator and was rich in harmonics. Such a warble can be varied 
1n center-frequency, range or rate. With the speech (heard binaurally 
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Different center-frequencies of a warbling sound are compared in A; different ranges 
of te, different rates in C. In D the effect of adding a steady tone to a stepped pattern 
nes is illustrated, The level of the speech was held constant at 95 db. 


rble 


in raised in 6-db steps until the functions shown in Fig. 3 had been 
f tained. Fig. 3-A compares warbles which had different center- 
Tequencies. The high-pitched warble (550 to 750 cycles) resulted in 

€ least interference and 50% of the words were still intelligible when 
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the warble was 20 db more intense than the speech. The low-pitched 
warble (170 to 220 cycles) interfered most, and 50% of the words were 
unintelligible when this sound was only 7 db more intense than the 
speech. 

In Fig. 3-B three warbles are compared which had the same rate 
(1.5/sec.) and the same center-frequency (450 cycles), but which dif- 
fered in the range of frequencies covered by the warbling tone. The 
widest range (50 to 850 cycles) was slightly less effective because it 
included some relatively high frequencies, but the differences are quite 
small. 

Different rates of warble are compared in Fig. 3-C, and no signifi- 
cant change in masking results from varying the rate of modulation 
between 3 and 15 warbles per second. 

Fig. 3-D presents results for a slightly different type of tonal inter- 
ference. In this case the complex tones varied in frequency by abrupt 
steps, and an irregular pattern of 5 tones ranging in fundamental fre- 
quency between 300 and 600 cycles was repeated over and over. The 
masking produced by this signal is indicated by the right-hand f unction 
in Fig. 3-D. A steady, complex tone of 200 cycles was then added to 
the stepped tones, and the left-hand function resulted. The improved 
coverage of the component frequencies of speech by this more complex 
oe increased the masking by about 10 db (for 50% word articula- 
tion). 

These results, which are typical samples drawn from a large body of 
similar data, show that the important aspects of these sounds are the 
aspects related to the adequate coverage of the speech-spectrum. 
Variations in the characteristics of the masking sound which do not 
affect this coverage do not affect the masking. ‘ 

Music. Since much of the popular dance music of the day is (to 
some people) noisy and annoying, the possibility that it interferes 
seriously with speech was worth investigating. Listening tests quickly 
revealed, however, that most music is inoffensive for two reasons. Music 
is divided into phrases, and between phrases are pauses, and during 
pauses speech is intelligible. And in the second place, clarinet and 


trumpet solos usually fall above the range of frequencies which pro- 
duces the most efficient masking of speech. 


A very complex masking sound is obtained, however, if two or three - 


phonographic recordings are played at the same time. The different 
orchestras fill in each other’s pauses, and the coverage of speech-fre- 
quencies is more consistently adequate. With a signal of this type, a” 
articulation function was obtained which was almost identical with the 
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function in Fig. 3-D for the complex stepped tones. Presumably, this 
function represents nearly the maximum masking which a tonal signal 
can produce. If the sound is made more complex it loses its tonal char- 
acteristics and begins to sound more and more like an irregular noise. 


MASKING By NOISE 


, In many ways white noise, having a uniform, continuous spectrum, 
is the ideal sound to use in studying the masking of speech. This noise 
Provides a continuous coverage of a wide range of frequencies, and the 
Spectrum can be manipulated by filters to suit the manipulator’s fancy. 
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Fic. 4, THE ARTICULATION SCORE AS A FUNCTION OF THE INTENSITY OF THE MASKING 


OISE, 
h ae different frequency-bands of noise are parameters. The level of the speeck was 


eld constant at 95 db. 


Thus we might use wide or narrow bands of noise, or noises “shaped” 

Oor maximal effect. : 
Narrow Bands of Noise. Let us review first some results obtained 
+ With narrow bands of noise. Highly discriminating filters (M-derived) 
Were inserted into the noise-channel to provide adjacent pass-bands as 
fa follows: 135-400 cycles, 350-700 cycles, 600-1100 cycles, 900-1500 
i Cycles, 1300-1900 cycles, 1800-2500 cycles, 2400-3120 cycles, and 3000- 
4000 cycles. These cut-off frequencies represent the frequency at which 


E 
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the response of the system was 6 db below the maximum in the band. 
The narrow bands of noise were then mixed with the speech, which was 
held constant at 95 db, and articulation tests were run. The results are 
shown in Figs. 4 and 5. In Fig. 4 the articulation score is plotted as a 
function of the noise-level, with the different bands as parameters. 
In Fig. 5 the articulation score is plotted as a function of the center- 
frequency of the bands of noise (vertical divisions indicate the approxi- 
mate cut-off frequencies of the bands), with the noise-level as the 
parameter. 


PER CENT WORD ARTICULATION 


FREQUENCY IN CYCLES PER SECOND 
Fic. 5. SMOOTHED Curves Drawn FROM THE DATA oF FIG. 4, SHOWING THE ARTICU- 
LATION SCORE AS A FUNCTION OF THE COMPONENT FREQUENCIES OF THE MASKING NOISE 
FoR DIFFERENT NOISE-LEVELS. 


Complete masking of the speech was obtained only with the two 
lowest bands. With bands of noise above 1000 cycles the speech could 
be heard even when the noise was more than 18 db above the level of 
the speech. Results with a wide band of noise (20 to 4000 cycles) are 
also shown in Fig. 4, and it will be noted that none of the narrow bands 
of noise produced more masking than the unfiltered noise. At the low 
noise-levels, the high-frequency bands were more effective than bands 
below 1000 cycles. At the high noise-levels, however, the low-frequency 
bands were more effective. This is seen most clearly in Fig. 5. 

In order to interpret these results it is necessary to recall the results 
of studies on the masking of pure tones by pure tones. Sounds of low 
frequency will, if intense enough, eventually mask the entire range of 
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frequencies involved in speech, but the high-frequency sounds are 
not able to mask the low frequencies of speech. The fact that, at low 
noise-levels, the high-frequency bands of noise are more effective than 
the low is probably due to the distribution of energy in speech. The 
higher frequencies of speech (cf. Fig. 1) are much weaker and are 
therefore more easily masked. As the intensity of the high-frequency 
noise is raised, however, the high-frequency sounds of speech (mostly 
unvoiced consonants) are masked, but this masking does not spread to 
the low-frequency sounds. If, on the other hand, a band of noise com- 
posed of low frequencies is gradually increased in intensity, it will at 
first have little effect because it is not so strong as the low speech-fre- 
quencies, nor is it intense enough to spread its masking effects to the 
higher, weaker speech-sounds. As the intensity is further increased, 
however, the masking effects begin to cover the entire frequency-range 
of speech, and complete masking is rapidly obtained. 

It follows from these results that the most masking is produced by 
a spectrum with the noise-energy in the lower third of the range from 
100 to 4000 cycles. Experimentation with a wide variety of “tailored,” 
sloping noise-spectra showed that the optimal masking noise has a 
spectrum similar to the long-interval speech-spectrum which is being 
masked. 

Modulation of the Noise. Random noise is often described as a steady 
“hishing” sound which, at low levels, is not unpleasant to hear. Unlike 
intermittent bursts of static or the irregular clacking of a typewriter, 
the sustained nature of random noise enables the listener to adapt to 
it and, to a certain extent, to ignore it. Consequently, different meth- 
ods were tried for changing or modulating the quality of the noise. 
One method used filters which were switched in and out irregularly. 
Another made use of the Sonovox, a device which can be substituted 
for the vocal cords as a sound-source for speech. With a Sonovox 
driven by a random noise voltage, the resulting ‘‘speech”’ resembles a 
hoarse breathy whisper. : 

The results with such noises need not be reviewed in detail. It is 
sufficient to say that the experimenters were again forced to the con- 
clusion that the critical dimension of the masking noise is its spectrum. 
Modulations of the noise may affect the listener’s comfort, for he will 
Say he prefers to listen to the steady noise. But when he is driven to it, 
he will understand just as many words with a modulated masking noise 
as with a steady noise. 

The Masking Wave-Form. Will two noises having identical spectra, 
but different wave-forms, produce the same masking effects? 
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In order to study this question a frequency-modulated (FM) audio 
oscillator was developed. This oscillator was modulated with noise, 


i.e., the frequency of oscillation was randomly varied. The resulting , 


wave-form, shown schematically in the upper half of Fig. 6, resembles 
a sine wave of constant amplitude, but of varying frequency. Once the 
spectrum of the frequency-modulated noise had been measured, it was 
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Fic. 6. FREQUENCY-MoDULATED (FM) Noise vs. RaAnpoM NOISE. 
The wave-form at the top of the figure schematizes FM noise; the wave-form below, 
random noise. The matched spectra of FM and random noise are shown in the lower half 


of the figure. 


then possible by the use of filters to “shape” the spectrum of a random 
noise of irregular amplitude in such a way as to make it correspond with 
the spectrum of the FM noise. When the two spectra shown in the 
lower half of Fig. 6 are compared, it is seen that the spectrum of the 
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Fic, 7. THe SHIFT IN THE THRESHOLD OF AvupīBILITY Propucep BY Two NOoIsESs 
Havine DIFFERENT WAVE-FORM BUT SIMILAR SPECTRA. 
(Cf. Fig. 6.) 


FM noise is 2 or 3 db lower at frequencies above 2000 cycles. Other- 


wise the spectra are very similar. 
The masking by these two wave-forms was then tested for both 


pure tones and speech, and the results are shown in Figs. 7 and 8. In 
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Fic. 8. THE ARTICULATION SCORE AS A FUNCTION OF THE INTENSITY OF Two Mask- 
ING Noises HAVING DIFFERENT Wave-Forms But SIMILAR SPECTRA. 
(CE. Fig. 6.) The level of the speech was held constant at 95 db. 
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Fig. 7 the masked thresholds are plotted against the frequencies of the 
masked tones, with the intensity of the noise as the parameter. The 
small discrepancy between the masking effectiveness of the two noises 
correlates with the fact that the FM noise contained slightly less energy 
at the higher frequencies. In Fig. 8 the articulation score is plotted 
against noise level with wave-form as the parameter. The two functions 
are not significantly different. 

In the light of these results it is apparent that, within certain broad 
limits, the masking produced by a sound depends upon the spectrum 
of the sound and is independent of the phase-relations among the com- 
ponent frequencies. This conclusion was verified in another set of 
tests which compared random noise with the same noise after it had 
passed through a circuit (a "peak clipper”) which sharply limited the 
instantaneous peak amplitudes of the noise voltage. Peak-clipped noise 
resembles a square wave randomly modulated in frequency, but its 
spectrum and masking effectiveness are the same as that of unclipped 
noise, provided the intensities are the same. Within broad limits, there- 
fore, the actual form of the masking wave as seen on a cathode-ray 
oscilloscope is of no consequence in determining masking. This agrees 
with the general hypothesis that the ear is relatively insensitive to 
phase-relations. The possible exceptions arise when the masking spec- 
trum is produced by very intense but very intermittent bursts of no!se- 
With such a sound the recovery-time of the ear becomes of appreciable 
importance, and we will have to consider specifically these temporal 
factors a few pages hence. 


MaAsxkinG By OTHER VOICES 


It has been said that the best place to hide a leaf is in the forest, anid 
presumably the best place to hide a voice is among other voices. ‘ A 
tainly it is a practical problem, for the babble of other voices 1S ê 
frequent background for our spoken words, and no survey of common 
masking sounds should overlook the noises which we ourselves produce. 
In considering “other” voices as sounds masking “the” voice, two COP- 
siderations are obviously crucial. How many other voices is the speaker 
competing with? And what are the other people saying? : 

It is relatively easy for a listener to distinguish between two voices; 
but as the number of rival voices is increased the desired speech is lost 
in the general jabber. This fact is demonstrated in the articulation 
functions of Fig. 9. The groups of interfering talkers were composed of 
an equal number of men and women (the single masking voice was @ 
man’s) reading and talking in a conversational tone. The “desired” 
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voices were, in all cases, male voices. For convenience in testing, the 
babble of voices was phonographically recorded by a high-fidelity 
recording system. 

Note that the single voice is a relatively poor masking signal, and 
that even two voices are less effective than four or more. Although the 
long-interval spectrum of a single voice is nearly optimal for masking 
speech, the spectrum at any moment does not include all of the neces- 
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Fic. 9, THE ARTICULATION SCORE AS A FUNCTION OF THE INTENSITY OF DIFFERENT 


Numprrs or MASKING VOICES. 
The level of the desired speech was held constant at 95 db. 


Sary frequencies. The variations in the level of a single voice are great, 
and there are relatively long intervals during which no masking sound 
is present. With several voices, however, a continuous masking signal 
1s produced. 

The content of the masking speech is a more difficult factor to 
evaluate. Conversational voices were compared with loud, excited 
voices liberally interspersed with laughter, cheering and improbable 
vocal effects. The two sounds could be likened to the chatter at a 
friendly dinner-party versus the din of a particularly riotous New Year's 
Eve celebration. There was little difference in masking, however. 
The shouting voices were a little more effective at low noise-levels, but 
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this was correlated with more high-frequency energy in the spectrum of 
the shouted babble. 

Conversational babblings in different languages were also compared. 
A language was chosen which the listeners did not know, but the 
masking was neither greater nor less than was obtained with an English 
babble. Once again, it is necessary to conclude that the crucial factor 
is the masking spectrum. The particular way in which the spectrum 
is produced is of secondary importance. 


100 


80 


60 


40 


20 


PER CENT WORD ARTICULATION 


o 20 40 60 80 100 


PER CENT OF TIME SPEECH WAS ON 
TIME ON 
( TOTAL TIME * 100) 


Fic, 10. THE ARTICULATION SCORE As A FUNCTION OF THE PERCENT OF THE TIME 


THE SPEECH WAS ON. 
The level of the speech was held constant at 95 db, and the speech-wave was inter- 
rupted ata rate of nine times per second. 


TEMPORAL CONTINUITY OF MASKING SOUND 


Not all of the sounds which force us to raise our voices are continu- 
ous: They come and go, change in quality and loudness. Consequently, 
some attention should be paid to this dimension of the masking sound. 
Because the ear is facile in patching together interrupted fragments of 
speech, an intermittent noise is not as serious a hazard as a continuous 
noise. 

In order to demonstrate that large portions of the speech can be 
completely blanked out without seriously lowering intelligibility, an 
electronic switch was used to interrupt the speech nine times a second 
for variable portions of the total time. The function in Fig. 10 shows 
the relation between the intelligibility of words and the percentage of 
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the time the speech was present. With the speech on only half the time, 
80% of the words were still correctly heard. It was not until 90% of 
the speech wave was missing that none of the words could be under- 
Stood. (It should be noted, of course, that the function of Fig. 10 
depends upon the rate of interruption and the type of test material, as 
well as upon the on-off ratio.) Qualitatively, the interrupted speech 
seems “hoarse” or “husky,” as if the talker had some disorder of 
phonation. Nonetheless, the speech is surprisingly intelligible. 
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Fic. 11, THE ARTICULATION SCORE As A FUNCTION OF THE INTENSITY OF INTER- 


RUPTED-Masxine NOISES. 
The percent of the time the noise was on as parameter. The level of the speech was 


held constant at 95 db. 


These results suggest that with an interrupted masking sound the 
ĉar should be able to integrate and interpret those portions of the 
SPeech available when the masking sound is not present. Consequently, 

€ electronic switch was used to interrupt a masking noise instead of 
the Speech. The results are shown in Fig. 11, where the articulation 
Score is plotted against the intensity of the masking noise, with the 
Percentage of the time the noise was on as the parameter. The noise- 
€vel in this case is expressed in terms of the intensity of the noise 
When it is on, and no average is taken between the level of the noise 
when it is on and the level when it is off. 
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When the noise is present less than 50% of the time, practically 
no masking is produced. Noise present 80% of the time is considerably 
less effective than uninterrupted noise. Apparently the recovery of 
the ear is rapid enough, and our ability to integrate fragments of 
speech is great enough, that any periodic interruption of a masking 
sound lowers its masking effectiveness. The careful documentation of 
this conclusion, however, remains for future investigators. 


SPEECH-TO-NOISE RATIO 


Articulation testing is a tedious and expensive business. The di- 
rect determination of the masking produced by a particular noise in 
some practical situation is not always feasible. Consequently consid- 
erable attention has been devoted, especially at the Bell Telephone 
Laboratories (8, 14), to the development of a computational procedure 
for predicting articulation scores. The simplest of the various pro- 
posed procedures relate the intelligibility of the speech sounds to the 
speech-to-noise ratios in the different frequency-ranges of speech. 

The speech-to-noise ratio, as the words imply, is the ratio in decibels 
of the speech-intensity to the noise-intensity. It is usually convenient 
to distinguish two different but related ratios: (1) the over-all speech- 
to-noise ratio, and (2) the speech-to-noise ratio per cycle. To deter- 
mine the over-all ratio the rms levels of speech and noise are measured 
directly and expressed as a ratio. Thus, for example, with speech at 95 
db and a masking sound at 90 db, the over-all speech-to-noise ratio 15 
+5 db. ia 

We have seen, however, that masking depends upon the distribution 
of the component energy over the frequency-range. Two noises with 
the same over-all level can produce quite different masking results de- 
pending upon the spectra of the noises. To estimate masking effective- 
ness, therefore, it is necessary to know the spectrum of the masking 
nojse and the spectrum of the speech. With an audio-spectrometer the 
two spectra are determined. The ratio in decibels of the speech-level- 
per-cycle to the noise-level-per-cycle at any frequency is the speech-to- 
noise ratio for that frequency. Thus the speech-to-noise ratio per cycle 
varies with frequency, while the over-all ratio does not consider the 
spectral distribution of the two sounds. 

Now in order to use speech-to-noise ratios in the computation of ar- 
ticulation scores, it is necessary to make three assumptions. These can 
be considered in order. 


1. Articulation scores depend upon the type of test-material used. With 
fixed speech-to-noise ratio, higher scores will be obtained with sentences 


any 
than 
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with words, and the lowest scores will be obtained with meaningless syllables. 

It is desirable, therefore, to obtain some more fundamental measure independ- 
ent of the particular test-material. Sentence, word, or syllable scores could then 
be related to this more fundamental index, and all three could be predicted di- 
rectly once the fundamental measure had been computed. Such a fundamental 
measure has been proposed by the Bell Telephone Laboratories. They call it 
the articulation index, and its derivation leads us to the second assumption. 

2. The contribution to the articulation index made by any narrow band of speech 
frequencies is independent of the contribution made by other bands. If the range of 
Speech-frequencies could be divided into a number of narrow bands each con- 
tributing the same amount to intelligibility, and each contribution independent 
of all other contributions, the sum of the individual contributions could be taken 
as the articulation index. 

_ The Bell Telephone Laboratories attempted to establish empirically the rela- 
tion between frequency and intelligibility by articulation tests conducted with 
filtered speech. With male voices, for example, they found that frequencies of 
Speech above 1660 cycles gave the same articulation score (68% with the test 
material they were using) as the frequencies below 1660 cycles. For that crew 
and those test-materials, therefore, an articulation score of 68% was equivalent 
to an articulation index of 0.50, and 1660 cycles divided the speech into two 

ands, one above and one below, which contributed equally, 0.50 apiece, to the 
total index. Ina similar manner the range of frequencies was further subdivided 
Until a complete function could be drawn relating the articulation index to fre- 
quency. Once this function was determined, the total range of speech frequen- 
Cles was divided into a convenient number of bands, each contributing equally 
to the articulation index (19). When the intensity of the speech in all the bands 
1s far enough above threshold, the total contribution of all bands gives an ar- 
ticulation index of 1.00. Just how far above threshold the speech in any band 
Should be brings us to the third assumption. ë 

3. The contribution to the articulation index made by any narrow band of 
Sbeech-frequencies depends upon the speech-to-noise-ratio in that band. If the 
Speech in a given band of frequencies falls far below the level of the noise in 
that band, the speech will make no contribution to the articulation index. It 
Is necessary, therefore, to use a weighting value to express the fractional part 
Ot the maximum contribution the band can make. This weight depends di- 
rectly upon the speech-to-noise ratio. When the speech is 30 db or more above 

€ noise, the band makes its maximum contribution, and speech-to-noise val- 


Ues greater than 30 db are considered as equivalent to the maximum weighting. 


The computational procedure evolving from this argument is 
Straightforward. First we determine the spectra of the speech and of the 
Noise. Then we divide the range of speech-frequencies into n bands. 
The weighting value for each band of frequencies is next computed 
Tom the speech-to-noise ratios in the bands. The maximum contribu- 
tion of every band is multiplied by its weighting factor. And then the 
Contributions of all the bands are totalled to give the articulation index. 

Jnally, the index is converted into sentence, word, or syllable articula- 
10n scores as desired. 
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Obviously this description of the procedure is highly simplified, and 
many special cases arise. For example, when narrow bands of noise 
are used to mask speech, the procedure does not provide adequate esti- 
mates of the masking effectiveness of the low-frequency bands of noise. 
Suppose that we have divided the speech spectrum into 20 equivalent 
bands. According to the computational procedure outlined above, the 
narrow band of noise will mask out 4 or 5 of the 20 bands, but the 
remaining 15 or 16 bands will continue to contribute to the articulation 
index regardless of the noise-level in the masked bands. Articulation 
scores should, according to prediction, fall a few percentage points and 
then remain constant for higher and higher noise-levels. Actually, as 
Fig. 4 shows, this is the case for high-frequency bands of noise. For 
low-frequency bands, however, there is a shift of masking into the fre- 
quencies above the band, and this spread increases as the intensity 
is increased. A similar example could, of course, be drawn from the 
results obtained when pure tones are used to mask speech. 

This particular difficulty can, perhaps, be encompassed in a compu- 
tational procedure if the masked threshold is used instead of the noise- 
spectrum. But in order to restrict the process entirely to pencil an 
paper, it is necessary to be able to predict the masked threshold for 
tones from the spectrum of the noise. This prediction can be made 
satisfactorily for noises with continuous spectra (5, 6, 7), but the pre- 
diction for tones or narrow bands of noise cannot be made at present. 

Another problem which was encountered in evaluating masking 
noises was the matter of temporal continuity. The procedure outline 
does not consider interrupted noises, even though interruptions ay 
crucial to masking effectiveness. If more were known of the build-up 
and recovery times of the ear, this dimension of masking could pre- 
sumably be quantified in a satisfactory way. For the present, however; 
the temporal aspect of the masking process has been neglected. 

. In spite of these difficulties, a simple computational procedure has 
much to offer. As a first approximation its usefulness cannot be denied. 
The emphasis is placed upon the distribution of masking energy 
relative to the distribution of the energy in speech and a survey o. 
masking noises confirms the belief that these spectral distributions are 
of central significance. Problems of wave-form, of patterning an 
modulation, of familiarity with the masking sound are ignored, a”! the 
results presented here indicate that these aspects of the masking sou? 
are of negligible consequence. 

The results of the research and of the discussion can therefore se 
briefly summarized: the greatest interference with vocal communication 


THE MASKING OF SPEECH 125 


is produced by an uninterrupted noise which provides a relatively con- 
Stant speech-to-noise ratio over the entire range of frequencies involved 
in human speech. Unfortunately, most of the noises we compete with 
fill this general prescription. 


THE QUESTION OF ANNOYANCE 


It is all well and good to know about the masking effectiveness of 
unwanted sounds, but few of us spend much time in boiler-rooms, air- 
Plane-cockpits, or machine-shops, and when we do ride the subway we 
Simply shut up until the car stops. Noises which we cannot outshout 
May be a hazard in some occupations, but most of us manage to get 
along fairly well in the average din of the average day. What really 
Upsets our vocal communication is the sound that, although it does not 
shatter our eardrums, is a nuisance, that distracts or annoys us, or just 

Sets on our nerves.” Every teacher knows the shudder that runs 
through a class when the chalk squeaks piercingly against the black- 
board, and an otherwise amusing radio program can become an agoniz- 
ing nuisance if we try to use the telephone. 

When we take this problem into the laboratory, however, it seems 
to disappear right in front of our ears. The major difficulty rests in the 
fact that the listener’s attitude is so important. If he is engaged in dif- ` 
ficult mental work, it may be relatively easy to annoy him. But if he 
istens with a defiant attitude, any attempts, to upset him with strange 
Noises may prove more amusing than effective. And since most of the 
Sounds we can use in the laboratory are out of context and relatively 
Meaningless, the task of being successfully obnoxious is practically im- 
Possible. Annoyance depends primarily upon the particular listener and 
the particular situation in which he finds himself. 

If, however, we are content to ignore some of the situational vari- 
ables involved, it is possible to ask listeners to compare different sounds 
On the basis of their “annoyance value.” Some simple listening situa- 
pon is standardized and the listeners compare pairs, use a rating scale, 
Ki rank-order an array of sounds. One can then evaluate the variables 

Ontributing to annoyance value as defined by the situation, although 
ti Safety with which the results can be extended to other situations is 

Pen to question. 

Listeners were presented with pairs of sounds and were instructed 

9. indicate which of the two sounds was more annoying. In making this 

€cision, the listeners were told to judge which sound of the pair would 
Tyee unendurable if they had to listen to it for a long period of time. 
ese instructions, therefore, constitute the definition of annoyance. 
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Fairly consistent results were obtained with groups of 10 to 20 listeners, 
and the scale of annoyance constructed in this way agreed closely with 
results obtained with rating or rank-ordering procedures. 

As an illustration, the results obtained with stepped patterns of 
tones will be considered. Eight different variables in the tonal pattern 
were studied for their effect upon annoyance-value. 


1. The higher the pitch of the component tones, the greater the annoyance- 
value. The range of frequencies tested was from 200 to 1500 cycles. 

2. A wide range of frequencies between the highest and lowest steps is more 
annoying than a restricted range. Listeners reported that the wide range 0 
component frequencies tended to be perceived alternately, first as a complete 
pattern and then as two patterns, one of high and one of low pitch. This effect 
is very similar to figure-ground reversals in visual perception. 

3. The addition of continuous tones to the stepped pattern of tones pro 
duces complex effects dependent upon the frequency-relation between the tones 
Beats give the sound a rough pulsing irregularity which the listeners disliked- 

4, Listeners asked to compare continuous sounds of different wave-shapes 
found the complex sounds, especially brief pulses, more annoying. In genera’ 
the sine wave was found to produce little annoyance. if 

5. Patterns of 3, 4, 6 and 12 tones were compared, but the number of di 
ferent steps in the complete pattern had little effect on the judgments of ane 
noyance, 

6. If one of the steps of a pattern is slightly longer in duration than th 
ers, a rhythmic quality is added which the listeners judged to be more ann n 
than tones of equal duration. Even more annoying, however, is the pattern i 
which all the tonal durations are randomly varying. 

7. A slow rate of repetition for a pattern of tones is considered slightly ™ 
annoying than a rapid rate. É 

8. Up to a certain limit, the annoyance-value is increased if silent in 
are introduced between the successive steps. 


e oth- 
oying 


ore 


tervals 


These results typify listeners’ responses to meaningless sounds 
When meaningful sounds like speech or music were used, the listeners 
refused to apply the word “annoyance” in describing them. Annoya 
did not seem to be a proper dimension of such sounds, but the listen 
were agreeable to calling the sounds “distracting.” Apparently, mean” 
ingful sounds have a higher ‘“‘attention-value” than meaningless sour” 3 

These experimental results, supplemented by results obtained wit? 
other types of sound, indicate that annoyance is related to three aspe® s 
of the sound. 


nce 
ers 


Loudness. The most important single factor in determining annoyance-judg- 


ments is the intensity of the sound. With sufficient intensity, any sound ee 
made annoying, and extremely loud sounds produce actual pain. Since tl 
variable is so fundamental to annoyance, care was taken to equate the intensit 
of the signals when other aspects were being studied. re. 
Pitch. In general, sounds having their energy concentrated among the highe 
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audible frequencies are more annoying than low-frequency noises (11, 16). In 
this respect, the frequency of the sound alters its annoyance-value in a manner 
Opposed to the effect on masking. With alow-frequency noise we cannot hear 
speech, but with a high-frequency noise we are more apt to be annoyed. 

_ Modulation of Loudness and Pitch. A third important factor is the modula- 
tion which the sound undergoes. Listeners report that they prefer to listen to 
continuous, unchanging sounds, and that a sound changing irregularly from 
moment to moment is more annoying than a sound which is changing regularly 
(12). Listeners feel that the distraction of a changing sound is less desirable 
than the boredom of a constant sound, and they retain this opinion even after 
many hours of articulation testing in the presence of different noises. Appar- 
ently the changes in loudness are more effective than changes in pitch, but the 
individual differences on this point are too conspicuous to permit a safe gen- 
eralization. 

_ These conclusions lead to the belief that what we are here consider- 
Ing as annoyance is a close relative to the problem of hedonic tone. 
Judgments of the pleasantness, indifference, and unpleasantness would 
Probably have led to very similar conclusions. Thus, while the results 
May be interesting as an exercise in experimental esthetics, the charac- 
ter of the problem has somehow been altered by the experimental 
approach. The principal concern, it will be recalled, is with annoyance 
as a hazard to vocal communication. On this score the results are 
Consistently negative, and at no point in the experimental results is 
there unequivocal evidence that the articulation scores obtained by 
trained listeners in the presence of an annoying sound were lower than 
the scores obtained in the presence of an indifferent sound which had 
the same acoustic spectrum. With the attitude adopted by listeners in 
the laboratory situation, annoyance is not a hazard to communication. 
And yet, sounds do differ in annoyance value, and annoyance or dis- 
traction does sometimes interrupt our verbal flow. Perhaps the most 
reasonable generalization, therefore, is that when a listener finds him- 


„Self in a situation where he is vulnerable to auditory annoyance, he is 


Most vulnerable to loud, high-pitched, unpredictable sounds. Just 
What situational and attitudinal factors contribute to his vulnerability, 
Owever, this research does not reveal. 


SUMMARY 


A wide variety of sounds have been investigated to determine the 
€xtent to which they interfere with vocal communication. The masking 
Of speech has been determined by articulation testing methods, and 
estimates of annoyance have been obtained by the method of paired 
Comparisons. 

The sounds are classified as noises, tones, and voices 
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types of sound, the stimulus-dimensions determining both masking and 
annoyance are the intensity, the frequency or spectrum, and the 
temporal pattern of the sound. Masking depends primarily on the 
speech-to-noise ratio over the range of frequencies involved in speech. 
Sounds of low frequency mask this range more effectively than sounds 
of high frequency. Interruptions in the sound decrease the masking 
effectiveness. 

Annoyance also increases as the intensity is raised, but low-fre- 
quency sounds are less annoying than high-frequency sounds, and inter- 
mittent, irregular sounds are more annoying than continuous sounds. 
There is no evidence, however, that annoyance interferes with vocal 
communications in the laboratory situation. 
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THE EFFECTIVE USE OF MANIPULATIVE 
TESTS IN INDUSTRY 


W. F. LONG AND C. H. LAWSHE, JR. 
Division of Education and Applied Psychology, Purdue University 


“The practical possibilities of psychological tests are now generally 
conceded, both by the professional psychologist and the industrial 
layman.... The immediate future is likely to see a very extensive 
application of tests to industry. . . .” These statements were made by 
Henry C. Link (35) in 1919. 

It is disturbing to realize that those words seem just as apropos today 
as they did twenty-seven years ago, and that there is little doubt that 
the use of tests in industry during the intervening period until now has 
not been as extensive as Link anticipated. What are the reasons for 
this slow development of industrial testing, and what are the possibili- 
ties of Link’s “immediate future” with the extensive application © 
tests to industry being close at hand? A review of pertinent literature 
should uncover some answers to these questions. 

Since most of the significant work in the field of industrial testing 
completed prior to about 1935 has been adequately summarized by 
others, notably Bingham (5), Viteles (50), and Garrett and Schneck 
(22), it is not necessary to include here reviews of work complete 
prior to that date. Obviously, even when that mass of data is elim- 
inated from review, the volume of literature on all phases of industria 
testing since 1935 is too vast for the present review; hence consideratio” 
is limited to the use of manipulative tests in business and industry’ 
which area is probably representative of the entire field. In additio”, 


the references reviewed are limited almost entirely to those which 


included objective validity data and, with the exception of a few reports 
of British investigations, only work done in this country is included. 
Certain related, general reviews and bibliographies of industrial testing 
by Tiffin (45), Sells (42), Zerga (52), and Benjamin (2), are recom- 


mended to those interested in a detailed coverage of the topic although 
none of them is included herein. 


EXTENT or UsE 
To substantiate the suggestion in the second paragraph above 
that the use of psychological testing in industry is not nearly as €x- 
tensive as it might be, certain reports can be noted. 
In commenting about Link’s state 
Taylor (43) noted in 1940 that the belie 
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testing in industry is bound to increase, has been hopefully voiced from 
time to time throughout the intervening period of time since Link’s 
article was published. He noted also that Hay (30) in the early part of 
1940 listed eight companies which to his knowledge were making 
Systematic and extensive use of tests in the selection of employees. 
Taylor then commented, “Granting that possibly two or three times 
as many other companies which did not happen to come to Mr. Hay’s 
attention may have test programs, the number carrying on such activi- 
ties certainly does not represent any sizable proportion of American in- 
dustry.” He suggested three reasons for the slow growth of testing 
Programs: over-enthusiasm on the part of exponents of testing, over- 
Skepticism by those unfamiliar with tests, and the expenditure of time 
and money required for the development of adequate industrial tests. 
It should be noted that Taylor did not include in his commentary a 
Part of Hay’s remarks which said, “Many more companies have partial 
Programs or are in process of developing a program, and it is probable 
that more serious attention is being given to psychological tests at the 
Present time than ever before. Nevertheless it is remarkable, consid- 
ering the success achieved by these companies, that every company in 
the country does not use psychological tests as a matter of course.” 
Some conflicting, but more convincing, testimony was given by 
°row (10) when he reported that a National Industrial Conference 
oard survey made in 1936 found that slightly more than seven per 
cent of 2,412 firms used psychological tests. Although this is a small 
Percentage of the total, the number is substantially larger than Hay 
r Taylor reported. Borow suggested in the same report that the 
number of companies using industrial tests in 1944 would dwarf the 
number listed in the National Industrial Conference Board survey, 
although no accurate figures were available to support his belief. 
$0. Judging on the basis of these few reports, there is no doubt that 
Mething is amiss in the use of psychological tests in industry, and 
at an explanation for this condition, which will at the same time sug- 
Sest remedial measures, is needed. 


SELECTION AND PLACEMENT 


en of the applications of testing in industry have been made ey 
as Nection with the selection and placement of workers, probably be- 
tor the most obvious use of tests is for those purposes, and beat 
of m fact that management and workers can be more readily convince! 
he value of tests for selection and placement than for other purposes. 
In discussing occupational differences in manipulative abilities, 
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Teegarden (44) said, “A battery of manipulative tests makes possible 
rating of job applicants on specific traits of manipulative performance; 
speed, accuracy, delicacy of control, single or two-handed manipulation, 
ability to follow demonstration or instruction, to react to details, or- 
ganize, maintain or increase speed, solve problems, etc.” 

He also reported that test performances of adults in 14 men’s and 
16 women’s occupations show that no two occupations present identical 
combinations of score levels on the tests used, and mentioned the obvi- 
ous conclusion that analysis of an applicant’s test scores indicates the 
type of occupations for which he seems best fitted. 

In their Summary of Manual and Mechanical Ability Tests, which 
includes objective descriptions of available mechanical aptitude tests 
and brief summaries of their application to various types of selection 
situations, Bennett and Cruikshank (3) wrote, “The assembly test oF 
apparatus test which may be a miniature replica of the job situation, i 
probably a very adequate test for job selection, particularly for semi- 
skilled workers. No doubt much more adequate measures can be de- 
vised in this form of test.” 

It would be impractical to attempt to consider the use of each tyP® 
of manipulative test separately, so a more logical division of test appli- 
cations into occupational groups has been made. 

Machine Operators. According to Hurt (32), in October of 1937 a 
garment factory in Marion, Virginia, started using four tests: Strengt 
of grip, speed of arm movement determined by the length of time re- 
quired to transfer 12 thimbles from one set of pegs to another and back 
again using one hand only, hand steadiness, which was scored by deter- 
mining in how small an opening in a metal plate the subject coul 
insert a stylus without making contact, and a general dexterity test. 
The latter test was considered to be the most valuable. It consisted O 
an inclined board with three equally spaced sockets in line at the front 
of. the board with a counting device behind each socket. The subject 
punched the stylus successively in each hole as rapidly as possible for 
one minute. The average score was 35 and it was found that a score © 
38 or better adequately predicted success of sewing machine operators: 
Although no quantitative validity data are presented, the author states 
that a few operators that were hired experimentally from the disquali- 
fied group demonstrated that they were not fitted for the job. 

Using a miniature punch press in testing 25 experienced punch- 
press operators, Tiffin and Greenly (47) found that speed on the test 
is closely related to both speed and accuracy ratings made by foremeD) 
the correlations being —.55 and .63 respectively. A high relationship 
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between accuracy on the test and a safety rating was also found to be 
present with a correlation of .60 reported. 

Bennett and Fear (4) administered a series of tests to machine tool 
Operators working on turret lathes, Bullard Automatics, precision 
grinders and milling machines at the plant of Martin and Schwartz, 
Inc. in Salisbury, Maryland. They used supervisor’s ratings as a 
criterion and obtained a correlation of .40 with performance on the 
Wisconsin miniature test for the engine lathe, .41 with a hand-eye co- 
ordination test consisting of a dummy drill press, and .46 with a hand- 
tool dexterity test involving the use of a wrench and screwdriver. 
Another approach to demonstrate the usefulness of the tests was taken 
by the authors in reporting that 91% of the men rated excellent re- 
ceived excellent or good total test scores, 75% of the men rated good 
received scores of excellent or good, and none of the men rated below 
average or poor received a score of excellent. 

A type of job-sample test used with sewing machine operators was 

Teported by Blum (7). In taking the test, the applicant operated a sew- 
ing machine performing two tasks: (1) following an irregular line and 
(2) sewing between two parallel though abruptly irregular lines. A cor- 
relation of —.42 between time required to complete the test and pro- 
duction records was obtained. He proposed certain critical scores for 
further use. If a critical score of 260 were used, 20% of the poor group 
and none of the good group would be eliminated; with a score of 240, 
28% of the poor group and 4% of the good would be eliminated; score of 
220, 56% of poor and 20% of good would be eliminated; and with 200, 
70% of poor and 28% of good would be eliminated. 
_ Anextension of the use of the Minnesota Rate of Manipulation Test 
1S reperted by Jurgensen (33). The Ziegler revision of the test was ad- 
ministered to a group of men hired to be converting machine operators 
Ma paper mill. The performance level on this test depends upon speed 
Of gross hand and arm movements. The men were divided into five 
8roups by each of three supervisors on the basis of speed of work. The 
Tatings thus obtained for each man were transformed into T-scores and 
the sum of the three scores for each man was used as a criterion. The 
Maximum multiple correlation obtained was .60 for a group of 60 men. 

Ross (38) found that when a critical score of 304 seconds on the 

Connor Finger Dexterity Test was used with a group of 41 machine- 
tool trainees, 90% of those rated “A,” 100% of the ‘‘B’s,” 85% of the 
C’s,” and none of the “‘D’s” or “E's” would have been selected. 

Andrews (1) reported the results of a year’s experience with selection 

tests for engineering operatives. A battery of seven tests including in- 
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telligence tests and five apparatus tests was administered to 122 
operators. Specifically, the apparatus tests included were a steadiness 
test, a finger dexterity test wherein small metal discs are placed as 
quickly as possible in recesses in a metal plate, a second dexterity test 
wherein pairs of screws under slight tension have to be unfastened and 
then fastened again, a bi-manual test involving simple coordinated 
movements of both hands, and a block sorting test made up of 100 
blocks, thirty-six of which have part of a pattern design missing. It 
was found that if the highest scoring half of the applicant group had 
been hired, 87% would have been satisfactory, rather than the 60% 
which were satisfactory when selected without the use of tests. 

Assembly Workers. Drake (16) used three tests, a pin board, a con- 
trolled turning test and a right-left turning test in experimental work 
for the Eagle Pencil Company. He found that those who scored above 
average on the test began training for a two-hand assembly job at 80% 
of normal worker efficiency and after two weeks training reached 97% 
efficiency. Those who scored below average on the tests began at 76% 
and ended at 91% of normal efficiency. 

In a small group of watch factory workers Candee and Blum (11) 
found a correlation of only .26 between foremen's ratings and O'Connor 
Finger Dexterity Test scores. There was a critical ratio of 2.18 betwee? 
mean scores on the test made by superior and mediocre workers. 

Experience with employee selection tests for electrical fixture as- 
semblers is reported by Tiffin and Greenly (46). In a group of 36 
workers who performed operations of burning, twisting, and soldering 
ends of insulated wire a correlation of .63 was found between foremen $ 
ratings of quality and scores on a hand precision test, and of —.16 be- 
tween test scores and production records. They also reported a ecrrela- 
tion of zero between ratings and finger dexterity test scores. For @ 
group of 33 workers whose job consisted of placing plug and socket, 
soldering tips on ends of wires, fastening the wires and assembling the 
plug and socket the following relationships were reported. A correlation 
of .22 was found between production records and dexterity test scores 
(with experience held constant), and of .49 with hand precision test 
scores. Using supervisor’s ratings of general efficiency as a criterion, 2 
validity of .33 was obtained for the dexterity test (with experience held 
constant), and of .42 for the hand precision test. For a group of 44 
operators whose work consisted of placing parts in a chassis and then 
connecting and soldering wires, the highest correlation found was only 
.27, that correlation being between dexterity scores and supervisor's 
ratings of efficiency (with experience held constant). 
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Blum (6) investigated the possibility of using the O'Connor Finger 
and Tweezer Dexterity Tests for selection of watch factory workers 
doing miniature assembly work. In a group of 137 workers, he found a 
correlation of .26 between finger dexterity time score and a salary ratio, 
.32 between tweezer dexterity time score and salary ratio, and .39 
between a combined finger-tweezer dexterity score and salary ratio. He 
alsoreported a significant difference between time scores made by workers 
with less than seven days of experience and scores of workers with more 
than one year of experience, the latter group taking less time as might 
be expected. 

Inspectors. Henshaw (31) used three performance tests with a group 
of 18 female paper inspectors. These girls inspected sheets of paper 
approximately 20X30 inches in size for weight and appearance by 
flipping the sheets over from one stack to another. All of the girls were 
25 years of age or under and had at least six months of experience on the 
Job. Using output figures as a criterion (reliability of .69 to .86) he 
Obtained a correlation of .70 with a hand-arm dexterity test in which 
the subject continuously tapped two 3" keys located one foot apart one 
after the other. A validity of .61 was found for a tactile discrimination 
test in which the subject sorted a stack including three grades of paper. 

Sing a choice-reaction test in which the subject reacted to one of three 
Stimulus lights by pressing the correct one of three buttons to extinguish 
the light, a correlation of .52 was found with output records. 

In studies of tests for the selection of inspector-packers, Ghiselli (23, 
24) found the following correlations between a combination of ratings of 
Supervisor and forelady: With a pegboard, —.50; with the Minnesota 

ate of Turning Test, —.40; and the Minnesota Placing Test, —.24. 
The duties of the group of 26 girls consisted of (1) filling capsules, vials, 
and bottles with serums and antitoxins, (2) examining the filled con- 
tainers for the presence of extraneous foreign matter, (3) labeling the 
Containers, and (4) cartoning and packaging them. P 

Other Industrial Workers. The validity of certain mechanical ability 

tests for selecting cotton mill machine fixers is reported by Harrell (27). 
tetrachoric correlation of .42 was obtained between scores made by 

5 loom fixers on a 15 minute adaption of the Stenquist Mechanical 
‘ssembly Test and proficiency ratings made by supervisors. A correla- 
tion of 84 between scores made on the same test by 40 fixers in a carding 
partment and ratings of mechanical ability made by an overseer was 
"€ported. The correlation between a composite of three ratings of 
Mechanical ability and test scores of 10 spinning frame fixers was found 
to be 78, It is interesting to note the substantial differences between 
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the reported correlations when the criterion used was a rating of me- 
chanical ability and when it was a rating of general job proficiency. 

Drake and Oleen (20) reported the development and use of eight 
apparatus tests and one paper test in connection with studies for the 
Eagle Pencil Company. A comparison of the earnings of four groups 
differentiated on the basis of test scores is of interest. A group of 13 
old employees with 32 months of experience who passed the test re- 
ceived wages in the amount of 116.1% of the average. A group of 10 
employees with 24 months of experience who failed the tests reached a 
wage level of 93.6% of the average. A group of sixteen new employees 
who passed the tests and had only two months experience reached 
97.4% of average level. A group of nine workers who passed the tests 
and received two weeks of training reached a level of 113.3% of average. 
They found that the cost of giving these tests was about two dollars 
per person which is less than half the difference in weekly wages between 
those passing the tests and those failing them. 

A battery of tests including tests of motor coordination, intelligence, 
and emotional attitude was administered to 2,246 applicants for semi- 
skilled industrial jobs according to Cleeton (12). From this number, 
849 applicants were selected for training, of which 546 successfully 
completed training. The only markedly significant difference betwee” 
median test scores for the group which completed training and the 
group which was disqualified for apparent lack of ability was found for 
the motor coordination test.. The median score for the first group was 
75.5 and for the second, 61.3. 

The use of certain undescribed machine and mechanical ability 
tests in the aviation industry is reported by Schultz (39). Of a group © 
17 men rated “first class” by foremen, 88% received satisfactory test 
scores. In a group of 50 men rated as “semi-skilled,” 88% also received 
satisfactory test scores. Only 42% of a group of 31 men rated as 
“bench-hands” received Satisfactory scores. A year later 25 of 27 men 
selected by tests were doing good or average work. 

Evans (21) found the median validity coefficient of a two-board, 
two-hand peg transfer test to be .34, The test was administered to 15 
samples employed at work involving the use of two hands. 

Cook (13) developed several tests on the basis of job analysis for the 
Western Electric Company. A coil winder test involving the task of 
winding wires in a prescribed manner around screws on a board differ- 
entiated a below average earning group from an above average group- 
In the above average group, 8% failed the test while in the below av- 
erage group 72% failed. With relay adjustors, he used a monotony 
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test in which the worker tapped through a hole to hit a lower plate 


` without hitting the sides of the hole in the top plate. In this group only 


9% of those above average in efficiency were below average on the test, 
while 75% of those below average in efficiency were also below average 
On the test. 

y Moore (37) reported the use of manipulative tests in several situa- 
tions. A Detroit company used the Detroit Assembly Test which con- 
Sists of two fairly large boxes of 80 one-inch wooden cubes and requires 
the subject to use both hands to transfer the cubes from one box to the 
Other at the same time packing the cubes neatly and in an orderly 
manner in the second box. When used in the selection of candy wrappers 
and packers, the company found that since the introduction of this test, 
96% of the workers were satisfactory as compared to 52% before the 
test was used. He reported the use by the United States Employment 
Service of a wooden modification of the O'Connor Finger Dexterity 
Test in which the subject is required to move two pegs at a time, one in 
each hand, to a second board. A correlation of .45 was found between 
Scores on this test and production records of 43 can packers. Moore also 
reported results of studies made by the United States Employment 

ervice of several occupations using the Minnesota Spatial Relations 
Test, The following correlations were obtained between the test scores 
and production records or other factors: for can packers, from .20 to .24; 
for Coding clerks, from .26 to .38; for calculating operators, from .32 to 
he, for card punch machine operators, from .31 to .55; and for power 
Sewing machine operators, from .40 to .50. 

As reported in Factory Management (53), the Ford Motor Company 
Used a hand dexterity test, a finger dexterity test, and a test wherein 
€ subject is required to determine the size of rivets by touch rather 
an by sight. Foremen agreed that workers selected by the use of 
Sts were superior to those selected by regular methods. In the same 
Publication (54) it was reported that the Woodward Governor Company 
Ockford, Illinois started using two tests of machine skill in 1938, 
in 1943 the management stated that they were convinced that the 
ly are of inestimable value to the organization, not only for their 
* lue in placement but in upgrading. In one of the tests described, the 

Ject controls a pencil by two cranks which turn in opposite directions 
TA at different speeds and is required to follow a pattern on paper. In 

€ other test described, the subject controls the path of a pencil by 

Wo Cranks, following lines on a revolving drum. Although the two 
“Ports in this paragraph contain only subjective validity data, the 
“sts and use made of them are especially interesting. 


te 


and 
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Crissey (14, 15) reported the use of tests in the AC Spark Plug ' 


Division of General Motors Corporation. A group of job-setters were 
given a number of paper and pencil tests, several well known apparatus 
tests, and two peg-board tests to check uni-lateral and bi-lateral 
dexterity. A validity of .65 for a selected battery is reported with 
supervisor’s ratings used as a criterion. It was further reported that 
69% of those who received test scores in the high third were also rated 
in the high third by supervisors, while none of those rated in the low 
third received scores in the high third of the group. A multiple correla- 
tion of .74 was found between a similar test battery and production 
records for aircraft spark plug gappers. The tests indicated that the 
most productive workers were above the average in bi-manual coordina- 
tion and visual perception. In the same group, 88% of those who scored 
in the high third on the tests were also in the high third based on pr°- 
duction records and all of those who had production records in the low 
third also received test scores in the low third. The tests also wer 
demonstrated to be of value in the reduction of worker turnover rate 
when it was found that turnover for personal reasons of male employee? 
for a one year period in a group pre-selected by tests was 5% while ina 
group not pre-selected the rate was 12%. He also presented one of the 
most convincing bits of evidence concerning the value of tests yet 
reported when he stated that as a consequence of hiring only those 
scoring in the high third on the battery of tests, the average productio" 
per operator was higher than the best previous individual record a” 
there was considerable improvement in worker morale on the job. 

As reported in Occupations (55), a battery of sixteen tests was ad- 
ministered to 51 aircraft riveter trainees. Those trainees whose tot 
test scores were in the upper third of the total group correctly set 26/0 
mone within a set period of time than those whose total test score? 
were in the lower third of the total group. The probability that such ® 
difference would occur by chance is less than 1 in 1,000. Using the same 
timed work sample as a criterion, a multiple correlation of .60 was 0?” 
tained for a battery of three tests which included a Worker-Analys 
Pegboard Apparatus Part I, a Worker-Analysis Finger Dexterity Test 
Part II, and a figure-copying test. 

Department Store Workers. Blum and Candee (8) investigated the 
possibility of using the Minnesota Placing Test, the Minnesota Turning 
Test and the O'Connor Finger Dexterity Test for selection of depart” 
ment store packers and wrappers. A multiple correlation of -38 W2 
found between production records of a group of fifty-two seasonal em” 
ployees and test scores while a similar correlation for permanent 
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workers was only .24. Efficiency on the job seemed to be the result of 
experience which was borne out by the fact that experienced workers 
Scored significantly higher than seasonal workers, and that no difference 
in test scores was found between satisfactory and less satisfactory 
Permanent employees. The authors suggested that these tests should 
not be used in selecting permanent employees. 

In a later study, the same authors (9) found that manual dexterity 
as measured by the O’Connor Finger Dexterity and the Ziegler Placing 
Tests is not a selective factor for department store wrappers. With 
Packers, a slight difference was found between the scores of experienced 
it inexperienced workers but this difference disappears with experi- 

nce, 

Ghiselli (25) found in checking the results reported by Blum and 
Candee that two of the same three tests plus a finger dexterity test 
would not be satisfactory as selection devices for package wrappers. 

n a group of 42 seasonal wrappers, the correlation of supervisor ratings 
with Minnesota Placing Test scores was —.10, with Minnesota Turning 
Test Scores —.02, and with the finger dexterity test scores .02. He 
Suggested that if motor and dexterity tests are to be used as selective 
devices for this type of worker, they should be different in nature from 
the three used. 


ACCIDENT PRONENESS 


In beginning a study on the prediction and control of accidents, 
Drake (18) found that only moderate relationships had been reported 
tween accident records and test scores when the tests had been 
andled in the conventional manner. He stated, “Far more significant 
relationships were found when the differences between scores on per- 
CePtual and motor tests were compared with an index that took ac- 
Count of both frequency and severity of accidents.” The results ob- 
tained lead him to propose the hypothesis that accident proneness is à 
Phenomenon associated with discrepancies in level between perception 
and motor reaction. It was observed that persons whose perceptual 
ae is equal to or higher than their motor level are relatively safe, 
a ile those whase perceptual level is lower than their motor level are 
Fa prone, with records of more frequent and more severe ac- 
€nts than the former group. According to Drake, this implies that 
thee who can see faster than they can react are relatively safe, while 
Ose who react faster than they can see are accident-prone. It is ap- 
parently possible however for the subjects studied to have a variety of 
Corrected defects of vision and still get perceptual clues that were 
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quite adequate for effective and safe behavior. Drake noted that 1n 
order to make comparisons between perceptual and motor levels it was 
necessary to use tests that yield measures of the function intended and 
that are as free as possible from other complicating factors. In support 
of his contention that there is a marked tendency for difference scores to 
pick out workers with the high accident frequencies and severities, 
Drake offered as substantiating data, a comparison between accident 
indexes and difference scores. It is sufficient here to note that the half 
of a group of industrial workers that had the best difference scores ha 
an accident index of 7.71 while the low half had an accident index of 
25.04. He also reported that in one new group of employees selected on 
the basis of test and difference scores, accident reduction was 10% of 
the rate of workers selected by previous methods. 

Williams (51) reported the use of a series of 15 simple tests in a study 
of accident prevention. The series included an interrupted pursuit test, 
a hand-eye coordination test involving dotting of small circles 0? a 
revolving disk moving at an increasing rate of speed, reaction to visua 
or auditory-tactual stimulus by pressing correct button, muscular 
hand-arm steadiness, and a hand dynamometer. Those scoring in the 
lowest quartile had an average accident history of .73 accidents per 
man per year while the average accident rate of the remaining 15% wae 
only .37 per man per year. 


TRAINING 


An interesting approach to the value of testing devices in training 
programs is reported by Tiffin and Lawshe (48). They found that 
hosiery mill employees with the poorest finger dexterity as measured 
the Purdue Peg-Board cost a company $59 each in minimum make-uP 
before they made the rate, while employees having the best dexterity 
cost the company only $36.40. They suggest that tests can answer three 
questions about training: (1). Who should be trained? (2). Where 
should training begin? (3). Has training been adequate? 

Martin (36) reported the use of a battery of written and apparatus 
tests by the Woodward Governor Company. He stated that the com- 
pany hadn’t lost a member of the organization because of technica 
inability during the year since the tests were begun and that aS 2 
measure for weeding out the untrainable, the tests had worked out just 
about 85%. 

Knowles (34) reported the experience of Northwest Airlines in using 
a battery of tests with general mechanics. The battery included one 
written test designed to measure ability to learn, a carefulness test in 
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which the subject is given a pile of metal pieces to assemble into units 
exactly like models, and a simple assembly test in which the subject 
assembles rows of bolts in a plate alternately from front and back while 
the plate is shielded from his view. In a group of 100 men hired for 
training, 25 were rated good, 50 fair, and 25 poor. \If the 50 scoring 
highest on the test had been chosen, 38% would be in the good group, 
58% fair, and only 4% poor. If the top 25 had been hired, 56% would 
have been in the good group, 44% fair, and none poor. 


DEVELOPMENT OF TEST PROGRAMS 


One of the striking impressions gained through reading published 
Teports of industrial application of tests is the lack of ingenuity evi- 
denced by many, if not most, investigators. The most successful test 
Programs that have been reported included tests that were designed 
€specially for specific situations. As could readily be discerned, how- 
€ver, the common procedure in too many instances was to get together 
a large group of tests that had worked in other places and administer 
them, hoping that some would prove to be of value. It alsoseemed that 
Many times tests were administered and then the most expedient 
Criterion was selected. Not disregarding certain advantages of standard- 
ized and ready-made tests, it seems painfully obvious that low validities 
in many situations were due to “shot-gun”’ test applications and care- 
less selection of criteria. The resultant apparent limited value of many 
testing programs doesn’t prove that tests are useless, but patently sug- 
8ests that new tests should be developed and that any standardized 
tests used should be selected carefully on the basis of job analysis and 
Possibly factor analyses, and that criteria should be chosen just as 
Carefuily, 

Improvement of Criteria. In reviewing the literature the authors 
Simply gained support for their own contention that one of the major 
Problems in the effective use of tests in industry is the difficulty ‘in 
obtaining adequate criteria. About eighty percent of the articles avail- 

e for review were only generally descriptive and included no ac- 
;.Ptable validity data. When given, validity data were often expressed 
A subjective terms, the usefulness thus being limited. Since all valida- 
a of tests, whether the methods used are simple or statistically 

©mplicated, depends on the availability and selection of adequate 
Titeria, the first step in any development program must be the isola- 
an of suitable criteria. To be most effective, a criterion must be 
‘liable, relevant, and free from bias. In order to obtain criteria that 
Meet these conditions, it is quite probable that new record systems of 
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production, employee ratings, absenteeism, accidents, tenure, training, 
and the like will have to be installed in many companies. 

Drake (17, 19) suggested that all conventional criteria be discarded 
and proposed that test results be used as the criterion against which to 
measure the success of management, particularly the success of super- 
visors in releasing the measured abilities of workers in production. 
Apparently he found himself forced to make this proposal when he 
found that foremen’s rankings, output figures, and efficiency ratings by 
time study techniques were proved to be unreliable as measures of 
operator ability on the job. This break with generally accepted pro- 
cedure apparently ignores such considerations as motivation which 
would certainly be operating if the applicant felt he was setting his own 
performance level at the time of testing. Furthermore, a reliable 
criterion is only half of the solution since validity is at least equally as 
important as reliability. 

Job Analysis. The necessity for the use of job analysis in develop- 
ing a testing program for selection of mechanical workers was pointed 
out by Moore (37) when he wrote: 

The level and type of mechanical ability that is needed depends upon job 
demands, and varies from one type of job to another. An assumption that 15 
frequently made, and that has been as often disproved, is that there is such @ 
unit as general mechanical ability. Mechanical ability can only be appraise 
when the job demands have been analyzed, interpreted, and converted into 
bodily elements. Therefore the selection of tests with which experimental wor 
is to be carried on is determined by a careful analysis into human demands, 4 
grouping of these demands in terms of functional similarities, and a preliminary 


selection of those tests which have been found valuable in appraising the same 
functions in similar situations. 


Drake has written a number of articles in which he advocates the 
use of job analysis as the best possible basis for test development. In 
easly studies reported with Oleen (20) the following job factors were 
considered useful in designing tests: 

. Length of cycle, 

. Nature of the elements of the cycle, 

. Sizes of materials and parts, 

. Serial order of elements of the cycle, 
Three-dimensional position of parts manipulated, 


Incidence of finger, wrist, arm, and body movements, 
Posture of the operator, 


Visual, tactual, and kinesthetic attentive factors, 
Speed and rhythm of work. 


SEN Ane N He 


When working with the Johnson and Johnson Company at New 
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Brunswick, New Jersey, to find suitable ways to measure observed 
differences in ability used on the job, Drake (17) found reason to com- 
ment, “We employed the methods then common to industrial psychol- 
ogy and still, I regret to say, too common in this field. We applied 
batteries of paper and pencil tests . . . and tried to find significant cor- 
relations between tests and various measures of success on the job such 
as foremen’s ranks, production percentage efficiency, etc. This older 
Procedure was cumbersome and costly and was characterized by trial 
and error to the extent that made it seem unscientific and illogical in the 
extreme.” From the techniques developed in this study using time 
Study analyses of jobs and groups of related jobs, six types of measur- 
able human abilities were isolated. They were: 
General finger, hand, wrist, arm dexterity, 
Dual hand dexterity, 
Bilateral hand dexterity, 
Hand and foot coordination, 
Machine tending ability, 
Inspection ability. 
Several performance tests were designed which had obvious similarities 
with the jobs studied and Drake reported that when these tests were 
administered again to the same operators after a lapse of time, suffi- 
Ciently similar scores were obtained to indicate that the tests measured 
Something relatively unchanging. = p 
Cook (13) reported that he did not use detailed job analyses in 
Selecting tests for his first work with the Western Electric Company, 
but followed the procedure of administering as many as 14 or 15 tests 
to the workers. He stated that the usual results were that two or three 
of the, tests would prove to be of selective value. This procedure was 
Considered to be wasteful of time and effort and consequently other 
Procedures were investigated. Ultimately job analyses were developed 
Which included nine factors as a basis for test selection. The factars 
eluded were intelligence, eye and hand coordination, finger dexterity, 
Manual dexterity, small-tool dexterity, repetitiveness of work, ac- 
curacy, range of observation, and visual memory. Cook wrote that these 
actors were selected because it was felt that adequate tests were avail- 
able for measuring an applicant's performance on any and all of them. 
€st it be overlooked, it should be pointed out that the selection of job 
analysis factors on the basis of available tests makes use of only a por- 
ton of the potential value of the job analysis results to a test develop- 
ment program. 
It seems probable that many psycholo! 


Qube 


gists working in industry who 
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have attempted to use job analyses in test development have not used 
them in the best possible manner. There has been too much of a 
tendency to designate abilities differentiated on the basis of superficial 
descriptions of the job or of worker abilities required to perform effi- 
ciently on the job. More thorough job analyses including the use of 
motion study to identify necessary bodily performance skills, and the 
identification of significant psychological traits and abilities such as 
visualization, numerical facility, and perceptual speed, should afford 
better bases for the selection and development of tests. 

Factor Analysis. The use of factor analysis as a means of develop- 
ing tests for industry is a forward step still to be used to the best pos- 
sible advantage. Even this method cannot be utilized to its full po- 
tentiality, however, until more and better tests are included in the 
analyses. 

Harrell (28, 29) reported a factor analysis of mechanical ability 
tests which included thirty variables, eleven of them apparatus tests 
including three pinboards, a pegboard, a peg sorting test, two tests of 
manipulating nuts and bolts, Crocket’s block packing test, a test 10- 
volving the placing of blocks along strips, a nut, bolt, and screwdriver 
manipulative test, and the wiggly blocks test. The tests were admin- 
istered to a group of ninety-one cotton mill machine fixers. He found 
five factors which he named perception of detail (P), verbal relations 
(V), visualizing spatial relations (S), manual agility (A), and youth (Y). 
The tests which appeared in the agility factor were pinboard with both 
hands, two of the nut and bolt manipulation tests, block packing, P1” 
board with non-preferred hand, wiggly blocks, and a paper and penc! 
dotting test. Harrell noted that the P, V, and S factors have been 
previously identified by Thurstone and others. He suggested tha* since 
certain of the group paper and pencil tests measure each of the factors 
present in the manual tests, other paper tests can probably be devised 
tc replace all of the manipulative tests. Considering the list of tests 
included by Harrell in the analysis, it seems highly improbable that 
much attention was paid to job analysis and that no attempt was made 
to develop tests especially to measure the abilities or aptitudes needed 
by machine fixers. He probably is correct when he suggests that paper 
and pencil tests could be devised to measure the abilities measured bY 
manipulative tests that were included in the battery, but it is quite 
possible that better and more apropos manipulative tests could be 
developed that paper tests could not easily replace. It would also seem 
appropriate to question the necessity for the inclusion of tests for the 
measurement of mechanical abilities which are highly weighted with 2 
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verbal factor, if equally valid manipulative tests could be developed | 
that exclude a verbal factor. 

Seashore (40) found that motor skill tests isolated into groups by 
factor analysis usually involved a similar pattern of movement regard- 
less of musculature or sense-field required in subject performance which 
1s a functional rather than an anatomical grouping of the skills. Later, 
Seashore in conjunction with Buxton and McCollum (41) suggested 
that the next step in identifying statistically located factors would be to 
use the technique of photographic motion study and simple introspec- 
tion by the subjects rather than the all-too-frequent second hand 
analysis or vicarious introspection of the experimenters. They further 
Suggested that any one physiological limit, such as speed or muscle 
contraction, is usually important only for a given work method and 
that changing to another work method may either partially or entirely 
Overcome this limitation. This suggestion serves as one explanation of 
the success of applications of specially designed tests and the relative 
lack of success when shot-gun application of tests is made. 

Guilford (26) makes some valuable suggestions in pointing out the 
results of certain factorial studies. He wrote: 

Buxton’s recent preliminary analysis tends to show that there are common 
Motor factors. As Seashore points out, however, intercorrelations among motor 
tests are notoriously low and clusters are limited in scope. This fact forecasts 
a number of motor abilities of narrow range and tests with relatively low com- 
munalities. General factors of broad scope among motor abilities may include 
Physical strength, agility, and steadiness of control. 


The application of factor analysis to test development in industry is 
Still rare, probably because the pressure to produce something im- 
Mediately applicable does not often permit the use of a relatively long 
range program of factor analyses involving an accumulation of the 
Necessary amount of intercorrelational data regarding tests and criteria. 

ith sufficient data, the factorial content of a criterion can be deter- 
Mined and thus the criterion becomes a much more meaningful target 
Or test development. One of the more obvious uses of the factorial 
8pproach to test development is to attempt to design pure tests to 
Measure those unique factors that are demonstrated to be present in the 
Criterion. As another result of factor analysis, the value of each test in 
Predicting the known factors can be determined, thus permitting a close 
‘stimate of the validity of a test and the avoidance of the possibility of 
including more than one test in a battery which actually measure sub- 
Stantially the same thing. It is also possible that a particular test may 

€ substantially loaded with a factor found to be related to success on a 
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job even though the test task has little apparent relation to job Se 
formance. The value of such a test probably would not be fully realize 
unless made evident as the result of factor analysis. It thus seems clear 
that factor analysis can assist considerably in making an industrial test 
research and development program effective. 


CONCLUSIONS 


When using published material as a basis for judging the effective- 
ness of the use of psychological tests in business and industry, it must 
be remembered that many companies do not publish the results of their 
testing programs and it is quite probable that much successful and un- 
successful work remains unknown to all but a few. Nevertheless, a 
consideration of the published accounts of work done with manipulative 
tests would seem to indicate several conclusions in answer to the ques- 
tions raised in the introduction of this review. 


1. There is no doubt that testing programs can be most effective when os 
psychologists responsible for those programs are thoroughly familiar with 1 
dustrial problems, D d 

2. In many instances it may be necessary to redesign employee rating an 
record keeping systems to afford adequate criteria. Š 

3. The effective combination and use of correct job and factor analyses oe 
test selection and development will contribute greatly to the rapid developmen 
of extensive and effective psychological testing programs in industry. g 

4. The practice of reporting test results in other than technical terminology 
will probably contribute to the more ready acceptance of psychological testing 
programs by industrial management and workers. be 

5. Some of the most valuable applications of tests in industry may well 
for purposes other than selection and placement of workers. 


If the above suggestions are incorporated into psychological testing 
programs, there should be a much more į 


rapid expansion of effective 
programs in business and industry than has been apparent to date. 
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THE TREATMENT OF QUALITATIVE DATA BY 
“SCALE ANALYSIS” 


LEON FESTINGER 
Research Center for Group Dynamics, Massachusetts Institute of Technology 


P The basic problem of measurement in connection with measuring 
Instruments such as questionnaires and interviews is to arrange a num- 
ber of individuals in rank order from high to low on the basis of their 
Tesponses to a series of questions. The solution of this measurement 
Problem, that is, being able to say that one individual is higher, lower, 
Or equal to another individual, along some specified dimension, has been 
attempted in a variety of ways. Dependence upon expert ratings, ex- 
amination of intercorrelations among items and correlations between 
individual items and total scale scores are some of the methods which 
have been used to facilitate scale construction. 

During the war Dr. Guttman, together with others in the Research 
Branch of the Information and Education Division of the War Depart- 
ment, developed a new technique for analyzing qualitative data which 
aed have called “scale analysis” (2, 3, 4, 12). This technique attempts 

O solve some of the problems of scale construction which have up to 
now not been adequately handled. 

There have as yet been relatively few publications concerning ‘‘scale 
ae The technique has, however, received quite a bit of favorable 
econ from social scientists. McNemar (7), for example, in a recent 

icle make the following statements: ’ 
Th he next step would be to construct a uni-dimensional scale for mea N 
3 of several components. Use of the Guttman technique would greatly : 

te the Meeting of this requirement.... A Guttman-type scale for each com 


peat would also avoid the absurdity of having several questions, which sup- 
fon tap a given component turn out... to have little or nothing in com- 


sional; we have stressed the basic need for reliability, validity, arg E 
with. ity for the instruments or devices used to classify or measure indi Joped 
respect to their opinions or attitudes. - - - Unitary scales can be developed 

the Guttman scaling technique. 


It is the purpose of this paper to review the published literature and 


S “scale 
oe unpublished material with reference to the theory of ee 4 
halysis,” techniques of scale construction using “scale analysis, @ 


n 3 hod. 
€ evaluation and interpretation of the scales achieved by this Ta 
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THE THEORY oF “SCALE ANALYSIS” 


The meaning of uni-dimensionality. When an investigator attempts 
to measure a given variable (e.g. attitudes toward Russia, intelligence, 
weight) he assumes, at least implicitly, that he is dealing with some- 
thing which has a unitary character. That is, that he can obtain meas- 
urements which relate unequivocally to this variable. 

If the concept in question, or if the measuring instrument uscd, is 
not of this type, then there will unavoidably be some equivocation in 
the use of the measurement scale derived. To take a very simple il- 
lustration, suppose we were measuring individuals with an instrument 
(e.g., a series of questions) which simultaneously measured prejudice 
toward Negroes and mathematical ability. Two individuals who re- 
ceived the same score on this measuring instrument need not at all be 
alike. One could be more prejudiced and the other more able mathe- 
matically. Similarly, if one individual gets a higher score than another 
individual, we would be uncertain to what to attribute the difference: 
The difference might be one of degree or it might be a difference in kind. 

In any measuring instrument which is not uni-dimensional, that 15 
which does not measure one and only one thing, the ordering of in- 
dividuals by virtue of their scores does not have the simple quantitative 
Properties which we desire in a scale. On an intelligence test, for example 
two individuals may achieve the identical score in very different ways- 
One individual might have good mathematical ability and another in- 
dividual good verbal ability. These two quite different individuals 
nevertheless may come out with the same total score. In what sense 
then can it be said that the two are equal in intelligence or in what sense 
can it be said that a person who gets a higher score has more intelligence 
than a person who gets a lower score? The differences which we obtain 
between individuals may be differences in type of intelligence or tyP® 
of ability, in addition to differences in the degree to which they possess 
these abilities. 

The conclusion should not, of course, be drawn that the scores 07 
intelligence tests, or the scores on any other measuring instruments 
which does not possess uni-dimensionality, have no meaning. On the 
contrary, they can still be useful and have a good deal of meaning (1): 
It can be said, however, that if such a test could be split into sever 
components, each of which did possess uni-dimensionality, then the 
value of this instrument would be greater than the value of the previous 
test. 

Where it is feasible, the attempt should be made to have our meas” 
uring instruments measure only one dimension or one variable at a time’ 
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Even if this is not possible, it is obviously desirable that the investigator 
have some indication of the extent to which his measuring instrument 
departs from the ideal of uni-dimensionality. 

The determination of uni-dimensionality. The problem still remains 
of determining whether or not a given measuring instrument possesses 
uni-dimensionality. Let us examine the characteristics which such a 
measuring instrument would have. 

It is implied by the preceding discussion that a uni-dimensional 
Measuring instrument would be such that a given score on that in- 
Strument could be obtained only from one pattern of responses. In 
addition, for these scores to represent an ordering of individuals with 
respect to some variable there would have to be a certain type of con- 
Sistency among the responses to the various items making up the 
measuring instrument (4, 11, 12). 

Let us examine what the data would look like if secured with a 
Measuring instrument known to be uni-dimensional. Suppose that in 
Order to measure the height of individuals a measuring instrument were 
chosen which consisted of ten sticks, each one of different length. The 
Operation of measurement is to stand each stick up alongside each 
individual and to record whether he is taller or shorter than the stick. 

€ record a plus if he is taller and a minus if he is shorter. Our data for 
each individual then consist of ten marks, each of which may be either 
plus or minus, 

We might now determine how many individuals received a plus on 
“ach of the measuring sticks. We could then arrange our sticks in order 
rom the one where the greatest number of people received a plus down 
to that stick where the smallest number of people received a plus. 

_ We would now find that any individual who received a minus on 
Stick “1” Would have received minuses on all of the other sticks; an 
individual who received a plus on stick “1” and a minus on stick “2 
Would have received minuses on all of the subsequent sticks and so on. 

e individual who had received a plus on stick “10” would have 
1 ceived pluses on all of the other sticks. Each individual would thus 
ave fallen into any one of 11 “patterns of response.” These patterns 
“ould be arranged in order from “‘tallest”’ to “shortest” and we would 
St US have a scale for measuring height. It may be noted that since this in- 
i tument was uni-dimensional, it would have been impossible for ba 
"dividual, for example, to have received a minus on stick “1,” a p. nd 
we Stick “2, and a minus on stick “3.” In fact, there are 1013 possible 

Patterns of response” which would not have occurred. 


F : to 
f our ten measuring sticks were ten questions which attempted 
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determine, for example, attitudes toward labor, the same type of con- 
sistency would evidence itself if uni-dimensionality were present. s 

If then, after having collected data, we find them to present this 
type of consistency, we may conclude that our measuring instrument 
possesses uni-dimensionality. It should be emphasized that whether a 
given set of questions (a given measuring instrument) does or does not 
| possess uni-dimensionality is entirely a matter for empirical determina- 
tion. 


THE TECHNIQUE OF SCALE ANALYSIS 


The Scalogram Board. The scalogram board technique for executing 
a “scale analysis” was devised by Guttman (4). It rests upon the fol- 
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Fic. 1. HYPOTHETICAL EXAMPLE OF A 
SCALOGRAM 


lowing considerations. If hypothetical data from our previous example 
concerning the ten measuring sticks were arranged in tabular form, each 
individual representing a row and each column representing a response 
to a measuring stick, the resultant pattern would be as shown in Fig. t 
It can be shown that for any measuring instrument which is um 
dimensional the rows and columns can be arranged so as to obtain such 
a “parallelogram.” If the measuring instrument does not possess unt 
dimensionality, then there will be deviations from this “parallelogram: 
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The scalogram board is constructed so that any row or any column 
can be lifted and moved to another position. The procedure of analysis 
is then to indicate in the appropriate column and appropriate row of the 
board whether the individual did or did not make that particular 
response. Then, by shifting rows and columns one can attempt to 
arrange the board so as to approach as closely as possible to the ideal 

parallelogram.” If the data can be made to form a parallelogram 
One can conclude that the measuring instrument possesses uni- 
dimensionality. The scale patterns conforming to the various scale 
Positions become apparent from the resulting diagram. 

The number of scale positions is, of course, determined by the num- 
ber of questions in the measuring instrument and the number of pos- 
Sible responses to each question. In the present example where there 
are ten questions each of which has two categories of response there 
Would be 11 possible scale positions. In general, the number of possible 
Scale types is given as follows (10): “Add unity to the total number of 
Categories in all questions and subtract the number of questions.” In 
other words, if our measuring instrument were made up of seven ques- 


tions, one of which had two categories, four of which had three cate- 


Bories, and two of which had four categories, the number of possible 


Scale types would be sixteen. i i 
The scalogram board has been used extensively by the Information 


ie Education Division of the War Department. There are several 
imitations, however, to its use. 


1. A scalogram board is fairly expensive to construct. 
2. The number of rows in the board must be large enough to accommodate 
the number of cases in the sample with which the investigator is dealing. If a 
ard is built which will handle 100 cases it will not be adaptable to any sample 
arger than this, The limitation on the number of questions and number of cate- 
ries in each question because of the number of columns in the scalogram 
Sard is of the same nature. j f 
3. The method is not very rigorous. Outside of possible hunches which the - 
€stigator may have concerning how the questions will scale themselves, the 
Method is largely one of trial and error and inspection. : i 
ch Since by the very nature of the method, the investigator manipulates 
Rance deviations to his advantage in trying to form a “parallelogram,” an 
uation of the resultant scale is somewhat difficult to make. 


iny 


$ B he Tabulation Technique. The use of this method for “scale anal- 
ysis” is described by Goodenough (2). It is probably the simplest 
Method available. The steps are as follows: 


1. One computes the number of people in the sampl 
each question. 


e making each response 
o 
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2. Each question is then graphically represented by a horizontal bar with 
proportional areas marked off indicating the number of people making each 
response. These bars for each question are placed directly under one another. 
Fig. 2 is a reproduction of such a diagram (10) based upon seven questions con- 
cerning the desire of enlisted men to return to full time school. 

3. Vertical lines are then drawn through all of the bars at the points separat- 
ing categories of response in each question. The responses between any pair of 
these vertical lines indicate one of the “consistent” patterns of response. 

4. The investigator then goes back to his data to see how well the response 
patterns of the individuals in his sample fit these scale types. 


This procedure is more rigorous than using a scalogram board. It is 
apparent, however, that in order to use this procedure it is necessary to 
know beforehand the directions of the responses to each question with 
regard to the dimension being measured. This means that in order to 
use this tabulation technique it is necessary to know, for example, that 
the four possible responses on Question 6 in Fig. 2 go in the direction 
indicated in the diagram with respect to desire to return to full time 
school. This is generally not too serious an obstacle. 

The originators of ‘‘scale analysis” propose that if one finds that the 
individuals in a sample do not adequately fit the scale patterns One 
may then examine the data to see if some of the responses to a question 
may be combined to make the fit more adequate. Thus, for example, aR 
investigator might find that by combining the categories of “undecided 
and “yes” on a given question his data might better approximate the 
scale patterns. The limitation to this is, of course, that again the 10- 
vestigator takes advantage of chance variation and evaluation becomes 
somewhat difficult. 

The Cornell Technique. This technique is so named because it has 
been used at Cornell University. It has been called the “trial scoring 
and graphic technique” by Goodenough (2). It is described in detail by 
Guttman (6). It consists essentially of making a trial rank order ° 
individuals and then examining to what extent the responses show the 
desired type of consistency according to this rank order. Consequent 
rearrangements in order and combinations of categories are made 8? 
as to approximate better and better to the type of consistency expect 
from a uni-dimensional scale. The investigator can then evaluate tO 
what extent the responses fit the scale patterns. The technique rests 
on essentially the same principles as the tabulation technique discuss¢ 
above. It has relatively little to recommend it as compared to the 
tabulation technique. It is more cumbersome and somewhat less 
rigorous. Its advantage would seem to be that it preserves somewhat 
greater flexibility for manipulating the items, 
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It may be noted that this procedure has been adapted for use with 
IBM equipment by Noland (8, 9). 

Least Squares Method. This is another method which was devised 
by Guttman (3). We shall do no more than mention its existence here 
since, as Goodenough (2) says, “This method is too laborious to be 
usable when one is dealing with more than a few items and categories.” 


CRITERIA For DETERMINING THE EXISTENCE 
Or Un1-DIMENSIONALITY 


Whichever of the above procedures is used for constructing a scale, 
the final problem which the investigator must face is that of making a 
decision as to whether or not his measuring instrument possesses uni- 
dimensionality. In other words, how well do his data fit those condi- 
tions which would exist if uni-dimensionality were present? 

For the purpose of making a judgment concerning the scalability 
of Stoups of items, the authors of ‘scale analysis” have introduced the 
Concept of reproducibility. From the statements in the published 
articles, it is somewhat difficult to understand just what is meant. The 
following are examples of statements made about it: 

Perfect scales are not found in practice. In the past, areas the component 
Items of which were at least 85% reproducible from a rank order have been 
called scales. Recent work shows that it may be desirable to be more stringent 
Shut errors and to restrict the word scale to areas the items of which are 

out 90% reproducible (12): , 
rep The degree of approximation to perfection is measured by a coefficient of 
roducibility which is the empirical relative frequency with which values of 
ne attributes do correspond to intervals of a scale variable. In practice, 85% 


Perfect scales or better have been used as efficient approximations to perfect 
Scales (4), 


From these statements one would conclude that the reproducibility 
Would be measured as follows: The responses of an individual which 
haactly fit one of the scale patterns would be 100% reproducible from 

S Scale score, Responses of an individual which deviated by 1 from 
Pa Of the scale patterns would be 100(m—1)/n percent reproduci b 
> ere 2 is the number of questions being scaled. Thus, if in a sample o 
ex. undred individuals who had each answered five questions, a 
à actly fitted the scale patterns, forty were 1 off a scale pattern an the 

“St were 2 off a scale pattern, the reproducibility would be 88%. 


du = Goodenough (2) tends to give a different impression of what is meant or 
Proility. One gets the impression from this article that what is meant by ak 2 
obyitibility is 859% of the individuals falling exactly in the scale patterns. 


Susly not what is meant by other authors. 
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In an example of a set of items which are considered to show a high 
degree of scalability one reads, 

There are but 16 possible scale types out of a total of 2,592 possible types. 
Actually almost 400 types occurred among the 2,000 men. Over one-fourth of 
the men were perfect scale types, and almost all the rest were one or two re- 
sponses off a perfect scale type (10). 

Guttman (4, 11, 12) has also made the distinction between scales 
and quasi-scales, the latter being measuring instruments which are 
less than 85% reproducible. As the subsequent discussion will show, 
this is entirely an arbitrary distinction. It would, perhaps, be better 
not to speak of uni-dimensional scales (except in very rare instances) 
and to content oneself with a description of the extent to which the 
scale on hand departs from the ideal of uni-dimensionality. 

Let us now examine how adequate a criterion 85% or 90% re- 
producibility is for deciding whether or not only one dimension is 
present. Let us take an example of five questions, each of which requires 
a true or false answer. We are to determine whether or not these five 
questions can be scaled, that is, whether uni-dimensionality is present. 
To make the example more specific let us suppose that to Question 1, 
80% of the sample answered false; to Question 2, 60% responded false; 
to Question 3, 50% responded false; to Question 4, 40% responde 
false; and to Question 5, 20% responded false. The number of possible 
response patterns to these five questions is 32. Only six of these, how- 
ever, would be “scale patterns.” If false indicated the same direction © 
information or attitude on all questions, these six scale patterns woul 
be (1) FFFFF, (2) FFFFT, (3) FFFTT, (4) FFTTT, (5) FTTTT, (6) 
DECE. If uni-dimensionality exists only these six would appear 1” 
the data. 

From this information alone, namely, the fact that there are six 
scale patterns out of 32 possible patterns, one could not very we 
estimate the probabilities of obtaining these six patterns. Let us cal- 
culate the chance Probability of occurrence of the six scale patterns 
(holding marginal totals constant), assuming complete independence 
among the questions. Since 80% answered false on Question 1, let us 
consider that “F” on Question 1 hasa probability of .8 and that Te 
on Question 1 has a probability of .2. Let us consider the probabilities 
of false and true responses on other questions in a similar manner. We 
then come to the conclusion that by chance 42.2% of the individuals 
would fit the scale patterns exactly. We also come to the conclusio” 
that 48.4% of the individuals would only be 1 off a scale pattern and 
9.4% would be 2 off a scale pattern. The reproducibility that one woul 


\ 
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ee chance would be 86%. It is clear then that for this specific 
za T a criterion of 85% reproducibility would be a very poor one to 
Let us examine what the chance reproducibility would be if we had 
at e TRE Falbe items that we were to make into a scale. Let us: 
Ei. > wee proportion of “F” answers to the questions are in 
eile a , 6, .5, .4, .3, .2,.1. There would now be 2° or 512 possible 
Pa ve: terns. Only 10 of these would be scale patterns. One finds, 
, i » that 18.5% of the individuals would by chance exactly fit a 
x ie Pattern and that 42.3% of the individuals would by chance be only 
Ss sy pattern. In fact, one finds that by chance one would expect 
85% o % reproducibility. Again it is apparent that the criterion of 
oa r 90% reproducibility would be a very poor one to apply for 
mining the presence of uni-dimensionality. 
| 
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It is clear that applying a criterion like 85% or 90% reproducibility 
to all attempts at scaling, irrespective of the number of items involved 
or the number of possible answers to each item, leads to false conclu- 
sions. In one case where there are many items and many parts to each 
question, 85% reproducibility might be excellent consistency; in an- 
other case 85% reproducibility might represent no better than chance 


occurrence and be no evidence at all for uni-dimensionality. Any - 


criterion for deciding whether or not there is uni-dimensionality must 
rest upon calculations based on the number of questions, the number 
of parts to each question, and the percentages of the sample in each 
part to each question. 

Let us take an example of an actual scale (10, see Fig. 2). Seve? 
items were used. The first item had three possible responses, the second 
two possible responses, the third had four possible responses, the fourth 
and fifth items had three possible responses each, the sixth item h@ 

_four possible responses and the seventh item had three possible re- 
sponses. There are 16 scale patterns. The total number of possible 
patterns is 2,592. The author states that over one-fourth of the me” 
tested showed perfect scale patterns. He does not present the rept” 
ducibility for the scale. Let us see what evaluation we can make of the 
evidence for or against uni-dimensionality. One can calculate that by 
chance 7% of the cases would fall into perfect scale types. If we compare 
this theoretical 7% with the obtained approximate 25% one wou 
certainly come to the conclusion that more people fell into perfect scale 
types than one would expect by chance in a sample of 2,000 cases- Toa 
major question is, however, whether or not this is evidence for WU 
dimensionality. Let us assume that there were two dimensions operat- 
ing. One could still obtain a greater proportion of perfect scale type 
than one would expect by chance. 

Suppose we attack the problem from the other end. Suppose 
we reason that the unreliability of the responses to the questions 
is what makes for deviation from the perfect scale pattern when unt- 
dimensionality does exist. If we assume that for each of the question® 
90% of the responses are “reliable” responses, (this would be equivale” 
to about 80% agreement on test-retest) we would conclude that 47.80 
would fall into the correct scale patterns and that 52.2% would not 
so because of unreliability with uni-dimensionality present. O oi 
52.2%, 3.7% would again fall into correct patterns by chance. The tota 
percentage of individuals that we would expect to find fitting the exal 
scale patterns would be 51.5%. If this theoretical percentage is CO, 
pared with the obtained 25%, it becomes clear that, if the assumed 


SS 


QUALITATIVE DATA AND “SCALE ANALYSIS" 159 


reliability adequately represents the data, more than one dimension is 
being tapped. 

It is suggested here that the expected occurrence of scale patterns be 
calculated assuming (a) independence among questions and (b) uni- 
dimensionality with a certain degree of unreliability. The obtained 
Percentage of individuals who exactly fit the scale patterns could then 
be compared with these theoretical percentages using chi square tests. 
Estimates of the actual reliabilities of the questions used would, of 
Course, make the interpretation unequivocal. 

It is the belief of the author that except for the most simple vari- 
ables, uni-dimensionality will not be found to exist in connection with 
the measuring instruments which social scientists can construct at 
Present. It would appear futile to insist upon uni-dimensional scales or 
to make very much of distinctions between scales which possess differ- 
ent “degrees of uni-dimensionality,” such as the distinction between 
Scales and quasi-scales. 

Scale analysis still provides the investigator with a good technique 
for scale construction and a means for determining quantitatively the 
extent to which his data depart from the ideal of uni-dimensionality. 
Such knowledge should help the investigator considerably in interpret- 
Ing his data. 


GENERAL EVALUATION OF ‘SCALE ANALYSIS” 


_ Thus far the technique of “scale analysis” has been used mostly 
With the armed forces. Guttman (5), and McNemar (7) both recom- 
Mend its use in attitude testing and in public opinion research. Limited 
Xperience with its use in public opinion research with civilian popula- 
tions has tended to show that it becomes an unwieldy instrument. In 
inter viewing people, maintaining good rapport and the interest of the 
interviewee are important considerations. The process 0 i 
many as ten questions, all of which are, to a large extent, rephrasings of 

€ same thing, is a considerable strain on relations between interviewer 
and interviewee. Most of those engaged in this type of research will 
Probably find the inclusion of a series of questions which could be sub- 
Jected to scale analysis not feasible from practical considerations. In 
Connection with standard paper and pencil attitude tests, however, 
~ale analysis offers the promise of considerable improvement in our 

asuring instruments. ee 
T Many claims have been made by the proponents of “scale analysis. Í 

hese claims are based upon the mathematics of uni-dimensiona 
Measuring instrument. Examples of these claims are (4): 


f asking as 
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Scale scores provide an invariant quantification of the attributes for predict- 
ing any outside variable whatsoever. 

In imperfect scales, scale analysis picks out deviants or non-scale types for 
case studies. 

Weare assured that if a person ranks higher than another person ina sample 
of items he will rank higher in the universe of items. 


These statements may tend to give spurious impressions of the power 
of the measuring instrument and may create in the user of scale analysis 
a greater confidence in his instrument than is justified by the conditions 
of measurement. Non-scale types may occur because actually more than 
one dimension is being measured or may occur simply because of un- 


reliability of the measuring instrument. 


Even if a perfect scale were 


achieved these claims would all be limited by the degree of reliability of 


the measuring instrument, that is, 


of the questions asked. 


It might be convenient simply to remember that a uni-dimensional 
scale of.very high reliability would have about the same properties as, 
for example, the common 12-inch ruler. These properties are, of course, 
quite important, but there is no need to exaggerate them. 

If as is suspected uni-dimensional scales do not at present occur even 
to a good approximation, then these various claims can be mostly 
ignored as far as the use of the scale is concerned. 

It should be emphasized again, however, that the technique of scale 
construction and evaluation as represented by ‘‘scale analysis” seems 
to be an excellent technique for use with paper and pencil tests or other 
instances of measurement where the situation permits the inclusion of 
several questions centering about the same topic. Where used it will 
provide useful information about the degree of departure from uni- 


dimensionality. 
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MANUAL OF CHILD PSYCHOLOGY 
A SPECIAL REVIEW* 


ROGER G. BARKER 
Clark University 


The appearance of the Manual of Child Psychology was a major 
event in psychological publication in 1946. The announcement of a 


successor to the Handbook of Child Psychology, 


published in 1933 and 


long out of print, was greeted with enthusiasm, and with the hope that 


it would prove as useful and influential 


as its predecessor had been. 


Many who studied the Handbook as students opened the Manual with 


eagerness, for here was a chance to view 


as a whole the changes in 


research on child behavior of these 13 years so momentous in science 


and society. 


Some differences between the Manual and the Handbook are readily 
apparent as the following comparisons show: 


Approximate length of text 
Approximate length of bibliographies 
Number of chapters 


New chapters added to the Manual: 


Animal Infancy (Cruikshank) 

Physical Growth (Thompson) 

Environmental Influences on Mental 
Development (H. E. Jones) 

The Ontogenesis of Behavior (Gesell) 
and Maturation of Behavior (Mc- 
Graw) replace Gesell’s former 
chapter on Maturation and Pat- 
terning of Behavior 


The quantity of child 
in the Manual roughly 20 


* CARMICHAEL, LEONARD. ( 
& Sons, 1946. Pp. viiit+1068, 
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Manual Handbook 
550,000 words 400,000 words 
4,400 references 2,800 references 

19 24 

Chapters of the Handbook ommitted 

from the Manual: 


Locomotor and Visual Manual Func- 
tions in the First Two Years 
(Shirley) 

The Social Behavior of Children 
(Buehler) 

Children’s Philosophies (Piaget) 

Speech Pathology (Travis) 

Eidetic Imagery (Kluever) 

The Physiological Appetites (Blatz) 

The Child of Special Gifts and Spe- 
cial Deficiencies (Hollingworth) 

The Child with Difficulties of Adjust- 
ment (Blanchard) 

Birth Order (H. E. Jones) (Incor- 
porated into the new chapter by 
Jones) 


psychology has increased in these 13 years: 
Percent fewer topics require 35 percent more 


Ep.) Manual of child psychology. New York: John Wiley 
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Space and refer to over 50 percent more publications. The quality of 
child psychology has increased too. The Manual reports greatly im- 
proved techniques and scientific standards, and careful checking and 
rechecking of 1933 results. In comparison with the Handbook, which 
appears adolescent in retrospect, the Manual is a mature, responsible 
Production in the fields it touches. 

But a strong impression made by the Manual is one of sameness and 
familiarity. There can be no doubt about it; Child Psychology, 1946 
(according to the Manual) is surprisingly like Child Psychology, 1933 
(according to the Handbook). It is true that the old material is more 
Soundly grounded than when reported in the Handbook, and perhaps 
this is all one should expect. Yet these 13 years seemed to be momen- 
tous as they were happening. We thought new discoveries were being 
made, and it seemed that new methods, new ideas, and new attitudes 
and practices were being developed. One can picture how a similar 
book in genetics, biochemistry or atomic physics would compare with a 
1933 predecessor! We thought Child Psychology, too, was a lusty, 
rowing, rebellious youngster. So we ask, is this report true? 

Let us consider how some of the growing points of child psychology 
are treated in the Manual. Take play, for example. Play has become 
a central technique of clinical child diagnosis and therapy, it is an ac- 
cepted experimental method, and it has been the object of fruitful in- 
vestigation. The index of the Manual cites play 14 times, but reference 
to the text reveals that all of these considerations are extremely brief. 

hen we look under the names of those who have contributed to 
achievement in this area (for example, Bender, Erikson, Lerner, Levy, 
acFarland, Murphy, Sliosberg, Stone) we find a similar summary 
treatnient. Here is one of the active areas of child psychology which the 
anual treats almost incidentally. 

We find the same kind of treatment of social behavior. Social be- 

Avior has been another productive area: important techniques and 
Concepts have been introduced, and valuable studies made (for ex- 
ample, H. H. Anderson, Criswell, Doll, Horowitz, Isaacs, Jennings, Lip- 
Pitt, Moreno, Parten, Partridge, Redl, Slavson, Tryon, M. E. Wright, 

eleny), But a student would certainly gain the impression from the 
anual that social behavior is a moribund aspect of modern child psy- 
chology, 

Child personality also receives scant treatment. Although Lewin’s 
Work is well represented, the contributions of Blos, Davis, Frenkel- 

runswik, Anna Freud, H. E. Jones, MacFarlane, Murphy, Sanford, 

Ymonds, Zachry, for example, are virtually ignored. Personality and 
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behavior aberrations in children are hardly mentioned. Sigmund 
Freud is named 15 times, while fifty other contributors are mentioned 
more frequently. 

The interrelations between physique and behavior in children (phys- 
iological, psychosomatic, and constitutional) have received rewarding 
attention in recent years, yet they are given only fragmentary considera- 
tion in the Manual. 

Recent methodological advances have been made in connection with 
multi-discipline and longitudinal studies, projective tests, play, inter- 
view procedures, sociometric techniques, experimental studies of com- 
plex behavior and the factor analysis of data. Save for experimental 
studies, however, although all are mentioned, they fail to stand out 
from the other methods used with more or less success in the last 40 
years, Since the science of any period is a product of its current meth- 
ods, it is especially unfortunate for the young scholars who may use the 
Manual that modern methodology is not given more prominence. 

In the case of theory of child behavior, we find ourselves again con- 
fronted with lacunae. One gains the impression that child psychology 
during the last 13 years has not been concerned with theoretical prob- 
lems, and that there have been few serious differences in viewpoint. 
Aside from Lewin’s chapter, theory is seldom mentioned. . The reader 
would have to be intuitive to appreciate that a series of battles of view- 
points have been raging across the child psychology front from the 
school room to the laboratory (differentiation vs. integration, whole vs. 
part, Gestalt vs. association, discipline vs. freedom, nature vs. nurture, 
pure case vs. statistical average, directed vs. non-directed therapy, PSY~ 
choanalysis vs. all comers, training vs. gratification, test vs. clinical im- 
pression). Clinical contributions to the theory of children’s benavior 
are not included. 

We have to conclude that the Manual gives a conservative, almost 
old-fashioned picture of child psychology by omitting or inadequately 
reporting some of the most important developments. A careful exami- 
nation of the text shows that some of these developments are mentioned, 
but that they are not placed in perspective. Currently important prob- 
lems and procedures do not stand out; issues are not clearly stated; 
results are not strictly evaluated. The Manual gives the impression of 
a catalogue, rather than a narrative. 

However, these imperfections should not be over-emphasized. The 
excellences of the volume should be equally stressed. To some, the 
general conservatism of the Manual will be considered its chief virtue- 
No book can serve all functions or satisfy all tastes. In fact the effort 
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to do this appears to be one source of the Manual’ s deficiencies. The 1933 
Handbook could barely accommodate in an omnibus volume the bulk of 
the material then available and retain some of its variety and individu- 
ality; but Child Psychology, 1946, is too extensive to be packed between 
the covers of this kind of a book without such fractionation and com- 
pression that its significance is greatly reduced and its character largely 
destroyed. 

The Manual performs in some degree the functions of a number of 
specialized publications, the functions, namely, of a reference work, a 
critical research survey, a yearbook, a bibliography and series of abstracts, 
and an advanced text. According to the preface, the Manual is “an 
advanced-level textbook,” but internal evidence indicates that a num- 
ber of the contributors had these other kinds of publications in mind to 
various degrees as well, publications with characteristics that seriously 
conflict when the attempt is made to include them allin a single volume. 
Thus, the vast detail which is a virtue in a reference work interferes 
Seriously with the book’s usefulness as a text. As a text, the Manual 
requires introductory material which spoils its effectiveness as a critical 
Tesearch survey for mature scholars. The requirements of completeness 
of coverage and non-evaluation in a bibliography and abstract are at 
Odds with the need for selection, evaluation and organization in a text- 

ook or a research survey. In a book for research scholars, emphasis 
upon undeveloped but potentially fruitful problems, and unverified but 
Suggestive methods, is a virtue but in a text these are liabilities. The 
inevitable slowness of producing a book of this size and with this number 
of contributors seriously interferes with the possibility of reporting cur- 
Tent developments. : 

Ar additional handicap with which the Manual had to contend was 
Wartime restrictions on publication. Because of this its release was ap- 
Parently greatly delayed; consequently there are not many citations of 
literature dated later than 1942. For this reason, its immediate revision 
is desirable, When this is done, it is to be hoped that the Manual’s 
Special function will be unambiguously defined. We are at the point in 
Psychology where reference works, research surveys, yearbooks, and ad- 
Vanced texts are badly needed, but it is clear from the Manual that we 
re beyond the stage when a single volume can accommodate such di- 
Verse materials. These are tasks which some divisions of the American 

Sychological Association might well undertake for their specialties. 

The order of the chapters in the Manual and brief comments upon 


them follow: 
Methods of Child Psychology (John E. Anderson) is an expanded and 
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greatly improved version of the same chapter in the Handbook. Espe- 
cially welcome are discussions of sampling problems, design of experi- 
ments, actuarial and individual prediction, genotype and phenotype, 
and statistical and psychological significance. However, this chapter 
suffers much from the limitations of this kind of publication. Under the 
necessity of touching upon everything from simple to complex and from 
new to old, the chapter becomes a conducted tour where much is viewed 
but where nothing can be examined. 

The Onset and Early Development of Behavior (Leonard Carmichael) 
is an expansion of the chapter by the same author in the Handbook 
entitled Origin and Prenatal Growth of Behavior. The previous article 
has become a classic and the present one continues on this same high 
level. However, one wishes for a book on a subject of this scope; in the 
present article there are an average of 140 words of comment per re- 
search cited. This has been a relatively active field in recent years. The 
chapter is 20 percent longer and has 43 percent more references than 
the earlier one; 40 percent of the 500 references are dated 1933 or later. 
In view of these changes, it is disappointing to find that the issues re- 
main almost exactly where they were in 1933. The author comes to the 
identical conclusion in 1946 regarding the central issue that he did in 
1933, namely “in regard to the related processes of individuation and 
differentiation of behavior it seems that as yet, at any rate, it is better 
to record as unambiguously as possible the responses that can be made 
by a fetus at any stage rather than to attempt to fit all developmental 
change into one formula.” 

Animal Infancy (Ruth M. Cruikshank) is a valuable addition to the 
volume. Here is a summary of research in a field that has been tilled 
little, but which has important potentialities. It is in the finest and 
most valuable tradition of scholarly research reviews to define and sur- 
vey such frontier areas. 

The Neonate (Karl C. Pratt) is an extensive revision and expansion 
of the same chapter in the Handbook. The present contribution is al- 
most twice as long as the original article; 45 percent of the 350 refer- 
er-ces are dated 1933 or later. While the 1933 review presented a frame- 
work, here is the completed structure. On the descriptive level, it 
appears that the neonate’s behavior has been fairly well mapped. It 
is to be hoped that the frontiers of research will soon be pushed to mat- 
ters of causation, and from the more physiological to the more psycho- 
logical problems. The work of Ribble (who is not mentioned in the 
Manual), for example, opens important problems deserving careful in- 
vestigation. This is another research survey which at times approaches 
an annotated bibliography (100 words of comment per reference). 

Physical Growth (Helen Thompson) is a new contribution, To the 
reviewer's mind, this chapter is out of place. First, as the author states, 
the material is so vast it cannot possibly be covered adequately. Second, 
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the crucial problem for psychologists in this connection is the nature of 
the relation between physique and behavior in childhood, and it is to 
this that a chapter should be directed. Although this relation has not 
yet been extensively investigated, it would seem that we have arrived 
at the point where a short chapter after the manner of Animal Infancy 
1s possible. 

The Ontogenesis of Infant Behavior (Arnold Gesell) is written to quite 
a different pattern from the other chapters. It is a stimulating essay 
Presenting an interpretation of developmental data in the form of some 
descriptive generalizations. This essay will be of interest to mature 
Scholars who will argue long and loudly whether these ingenious gen- 
eralizations are true and whether they are productive of further research 
and understanding. 

Maturation of Behavior (Myrtle McGraw) covers the maturation vs. 
earning problem. Few questions in psychology have had such a turgid 
course, and here is another competent effort to clarify the issues. 

espite the author’s protestations of the falsity of the dichotomy, she 

nds it necessary to wrestle at length with a systematic statement of the 
relation between learning and maturation. It may be that this was in- 
evitable when the task of writing a chapter with this title was accepted, 
for it implies that a general solution is possible, which then has to be 
laboriously disavowed. Perhaps in the next Manual the author’s con- 
clusion will be heeded and we can have chapters on the Effects of Train- 
ng upon Early Behavior Development and Neural and Physiological Cor- 
relates of Behavior Development in Children, and save ourselves the pain 
of struggling with the monster of interdependency again. 

_ Learning in Children (Norman L. Munn) is almost twice as exten- 
Sive as the 1933 chapter by Joseph Peterson. Of the 340 references, 36 
Percent are dated 1933 or later. In the face of great difficulty, due to 
the chaotic state of learning studies, the scope of the problem, and the 
lack of studies directly comparing the learning of children and adults, the 
author has reviewed a great number of studies of conditioning, acquisi- 
tion of motor skills, memorizing and problem solving which use children 
as subjects. He concludes that beyond the very early childhood years 
when the handicaps of neuromusculature immaturity have been largely 
Overcome, practically all differences in the learning of children and 
adults may be attributed to differences in motivation and in previous 
experience., 

The Measurement of Mental Growth in Childhood (Florence L. Good- 
€nough) is relatively little changed from the Handbook version. Of 109 
Teferences 34 percent are dated 1933 or later. Additions include the 
Work of Kelley, Thurstone, Thompson, and McNemar on the factor 
analysis of intelligence test scores, and Bayley on mental growth. This 
mpresses one as being written in the best tradition of an advanced text. 


Language Development in Children (Dorothea McCarthy) is at the 
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same time one of the most active and hopeful, yet disappointing, aspects _ 
of child psychology reported in the Manual. The present chapter is over 
three times the length of the Handbook chapter. Of the 480 references, 
40 percent are dated 1933 or later. Here is an active field that touches | 
the basic concerns of child psychology: emotion, personality, intelli- \ 
gence, social development, behavior aberrations, education, neurology- 
However, few of these larger aspects of language have been subjected 

to investigation. The problems have been defined within a narrow, 
technical frame, so that language has been largely divorced from its 
significance as behavior. 

Environmental Influences on Mental Development (Harold E. Jones) 
is a new contribution to the Manual. It is packed with information. 4 
Here is an adequate reflection of a preoccupation of the times; sixty-tw° 
percent of the 235 references are dated 1933 or later. Of all the chapters 

_ in the Manual this one probably suffers most from the difficulties im- 
posed by the volume. Under the necessity of covering material ranging 
from I.Q. constancy to diagnosis of identity in twins, and of writing on 
all levels from pointing out the possibility that coaching affects test 
scores to discussing the phenomenon of statistical regression to the 
mean, it has been difficult to present a sharp picture of the issues in- 
volved. The chapter will be of great value as a review and commentary 
for students who bring considerable technical background information, 
and to those who are able to continue their study in the literature cited. 

The Adolescent (Wayne Dennis) meets the issue of the Manual’s 
ambiguity of function by sharply limiting the scope of material covered. 
The psychology of adolescence is restricted to the effects of biological 
adolescence upon behavior. This makes it possible in the space available 
to give an adequate review of the research upon this problem. & 

Research on Primitive Children (Margaret Mead) is a paper for re- 
search scientists. It is a stimulating discussion of the methodology 9° 
personality studies in primitive cultures, and one that is almost equally 
applicable to field studies of children within our own society. This" 
chapter provides a sample of the value of a volume prepared for scholars 
with no concessions to immature students. A unique and valuable de- 
parture is the inclusion in the bibliography of citations of research in 
progress. 

Character Development in Children—An Objective Approach (Vernon 
Jones) is a revision of the chapter by the same author in the Handbook 
entitled Children’s Morals. This has been a relatively active field; 55 
percent of the 140 references are dated 1933 or later. Perhaps because 
of the nature of the material, this chapter combines better than most 
the textbook and research-survey functions. 

Emotional Development (Arthur T. Jersild) is a systematic review of 
objective studies of emotion in children; 51 percent of the 238 citations 
are 1933 or later. It appears that the frontiers of this most wild an 
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difficult region of child psychology are slowly being pushed back. The 
Picture of healthy progress here given is probably an underestimate by 
reason of two limitations under which the author has worked: 1) the 
exclusion of all studies not reaching completely to acceptable standards 
of scientific method and 2) the separation of emotion from other aspects 
of the behavior of children. It is obvious that some of the potentially 
Most fruitful developments of recent years are as yet outside the bounds 
of acceptable science and have had to be omitted. The fractionation of 
the child and the separate consideration of intellect, language, charac- 
ter, learning, emotion, etc. probably is most limiting and misleading 
In the case of emotion. 

Behavior and Development as a Function of the Total Situation (Kurt 
Lewin) is the only chapter in the Manual written within a definite con- 
ceptual frame. However, it is by no means a speculative discussion. Al- 
most all of the 156 references refer to experimental work; 79 percent of 
the references are dated 1933 or later. This is the most complete state- 
Ment of Lewin’s position available at the present time. Like so much 
p Lewin’s writing, this paper suffers at points from extreme concise- 

ess, 

The Feeble-Minded Child (Edgar A. Doll) extends the treatment of 
the same subject by Pintner in the 1933 Handbook. The material has 

cen organized to emphasize standpoints rather than to survey the de- 
tailed literature. The topics covered are definition, classification, in- 
Cidence, characteristics and causation. Forty-six percent of the 164 
Teferences are dated 1933 or later. 

Gifted Children (Catherine Cox Miles) is in the pattern of the chapter 
of the same title by Terman and Burks in the Handbook. The framework 
there erected has been filled in with remarkably little alteration. Valu- 
able new material on the later history of Terman’s subjects is reviewed. 

ere are 330 references; 42 percent of them are dated 1933 or later. 

Psychological Sex Differences (Lewis M. Terman and Associates) is a 
thorough, critical review of the widely dispersed data bearing upon this 
topic, It will be an important starting point for a long time to come for 
Tesearch scholars concerned with sex differences. 


The Manual of Child Psychology impresses one with the number of 
Taw facts that have been reliably observed and recorded and with the 
Number of interrelations that have been described. This very wealth of 

ata is becoming a burden. In the absence of a strong framework of 
E cory to organize and subsume such an array of facts, and to guide 
urther observation and experimentation, we are in danger of being en- 
gulf ed. As a theoretical structure is developed we can expect the defi- 
ition of child psychology to change from one based on chronology (that 
Part of general psychology which uses chronologically young individuals 
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as subjects) to one based upon psychological criteria. When this occurs, 
child psychology will become less disorganized and less encyclopedic and 
the task undertaken by the Manual will be less formidable. This is the 
basic difficulty confronting the Manual: lack of concepts and hy- 
potheses appropriate to the empirical content of child psychology. As a 
compendium of a great amount of this content, and asa publication base 
in terms of which more specialized books may be produced, the Manual 
will have a great and permanent value. 


RESPONSE TO CRESPI’S REJOINDER AND 
CONRAD’S REPLY TO APPRAISAL OF 
OPINION-ATTITUDE METHODOLOGY 


QUINN McNEMAR 
Stanford University 


The rejoinder of Crespi (4) and the reply of Conrad (2) to our ap- 
Praisal of opinion-attitude methodology (9) necessitate a published re- 
sponse to a few points, particularly those having to do with facts. We 
will first consider three of the points raised by Crespi. 


1. We acknowledge a serious error in reporting Cantril’s material on 
the reliability of the polling procedure. Though we correctly reported 
the only figure for the reliability of opinion questions, we did confuse 
the data regarding factual-type questions. 

2. It is said (4, p. 563) that we “overlooked” at least three articles 
on reliability, and that if we “had seen” these we “would have apprect- 
ated the injustice” of our remark that in opinion research individual 
“reliability has been practically ignored by the users of the single ques- 
tion technique” (9, p. 314; our present italics). We did, indeed, fail to 
report on the three cited papers, but not because they were unseen. We 
Concur with Crespi when he says, “Interested readers are invited to 
check the facts here for themselves” (4, p. 563). What are those facts, 
and are they pertinent? Jenkins’ paper (7) is concerned solely with ques- 
tions of the type “What brand did you last purchase?,” not with opinion 
questions. Section D of Dodd’s report (5) contains empirical findings on 
Teliability—of questions pertaining to radio listening habits—with no 
Specification as between factual and opinion questions. Now it happens 
that we had also seen the 148 page, security-marked “confidential, 
document referred to by Dodd (5, p. 266). This does contain informa- 
tion on the reliability of opinion type questions, but unfortunately the 

ata are not analyzed and presented in a manner which makes it pos- 
sible to ascertain how reliable the questions were. King’s article (8) deals 
With 15 questions which are “Gdea-centered,” hence atypical as regards 
Ordinary single question procedures. Furthermore, detail is lacking for 
Svaluating his report that the agreement between results secured from 
a cases by two interviewers, with a two week lapse between interviews, 
anged from 60% to 100%. ae... 

3. Crespi says (4, p. 567) that our handling of validity is “plain 
contradiction,” which results in “Three errors—an error of fact, an error 

Ogic, and an error of unjustified condemnation.” In proving all this, 

€ quotes from p. 315 of our appraisal. At the risk of tiring the reader, 
G Present the quotation along with parts (here bracketed) omitted by 
espi: 
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One might have expected that Cantril’s volume (1) would have tackled the 
problem of opinion question validity but such is not the case. [Evidently “the 
serious problems encountered in every phase of the polling operation” (1, P 
viii) do not include what the writer considers to be foremost in importance, 
namely, validity and reliability for the individual response.] The volume does 
touch on some of the factors which contribute to [unreliability and] invalidity 
[but a frontal attack is sorely needed]. 


In what way is any of the above “‘an error of fact”? We urge the reader 
to search Cantril’s volume for a validity study. Then having foun 

none, he will see that the “contradiction” and the “error of logic” con- 
sist merely in having reported, in a credit line, that some of Cantril’s 
material has a bearing on the problem of validity. Crespi correctly gives 
the location of this material when he states that “the entire point © 
chapter 1 and, in part, chapter 2 of Cantril’s volume is the validity of 
questions” (4, p. 567). Actually, chapter 1 on the “Meaning of Ques- 
tions” contains no mention of validity, and in chapter 2 on the “Word- 
ing of Questions” validity is mentioned only in the discussion of the 
“absence of any objective criteria of validity” (1, p. 23). 


We now turn to five specific points in Conrad’s reply and then wẹ 
shall discuss briefly a method used in his evaluation. 


1. Regarding the question of item selection, i.e., the aposteriori 
singling out of particular items for study, we too are “‘at a loss to under- 
stand” (2, p. 581) how we overlooked an entire paragraph which not 
only definitely negates our statement that no reason was given for 
choosing items, but also supplies information in line with our surmise 
about possible capitalization on chance. We hereby tender our apology 
to Drs. Sanford and Conrad for this oversight. 

2. It is evident that Conrad does not (and need not) agree with us 07 
issues having to do with scales, uni-dimensionality, and reliability, but 
when he defends his scales by claiming that the average inter-item cor- 
relations of .08, .12 and .07 for his war-optimism scales are ‘‘as high aS 
[those] among items of acceptable intelligence tests” (2, p. 572, with re- 
peats of the same idea on pp. 575 and 586), we wonder what empirical 
data on acceptable intelligence tests he had in mind. He states, without 
giving a source, that “‘intelligence-test items ordinarily correlate with 
the total test only about .30-.50” (2, p. 572). Now to some figures. The 
correlations for items vs. total score for the 1937 Stanford-Binet Revi- 
sion (10, pp. 176-185) range from .27 to .91, with 189 of 243 coefficients 
at or above Conrad’s upper value of .50. The mean of these 243 correla- 
tions is .61. In the case of the Terman-McNemar Test of Mental Abil- 
ity the average item-total score correlation is .53 for r’s based on 1200 
subjects (13, p. 2). As to inter-item correlations, the average value for 
the 1937 Stanford-Binet Revision (10, p. 156) is about .38, and from thé 
above mean of .53 it can be inferred by use of Richardson’s Formula 7 
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(11, p. 70) that the average intercorrelation among the 324 items of the 
erman-McNemar test is near .28. i 
x One other point on uni-dimensionality: we agree with Conrad's 
p sument (2, pp. 572-573) that for prediction purposes it is permissible 
ot, ombine an index anything that improves prediction, but we were 
< iscussing the problem of prediction. We had questioned the mean- 
h ee total scores based upon items of near zero intercorrelation and 
aa criticized as meaningless an index of “life satisfaction” based on 
oe uncorrelated components. Since no outside criteria were available, 
C e problem in both these cases was that of scaling not prediction; hence 
Tht has confused the issue by bringing in an irrelevant argument. 
k ae reader may refer to Guttman (6, p- 148) for a clear-cut statement of 
€ necessity of keeping in mind the distinction between these two 
Osely related topics. 
as Conrad thinks we are “seriously in error” (2, p. 577) in stating 
obt When an item has a reliability of .25 it can be said that 75% of the 
ained response variance is due to measurement errors. Now if we 
understand Conrad’s argument against this well-known statistical fact, 
a is defining an individual’s current attitude in terms of the obtained 
nee as though the response were not subject to error. He says 
it at “Current attitudes, thus measured, can be denied validity only if 
be assumed that item-responses are indeed evanescent” (2, p. 577). 
ae need not assume evanescence if Conrad’s reliability estimate of .25 
correct—such low reliability conclusively proves the assumption. 
te 4. Regarding part-whole correlation, Conrad says that “McNemar 
whrstaken” (2, p. 578). This is proven by a mathematical derivation 
i a begins with the supposition that “if the set of 10 items had a 
ws her standard deviation than the set of 14.” This supposition 1s con- 
Scale to the facts: the 10-item scale had an S.D. of 4.1 and the 14-item 
: e had an S.D. of 5.9 (3, p. 293). Consequently, if our original con- 
pak needed an algebraic proof it has now been supplied by Conrad. 
ply. Conrad (2, pp. 581-583) counters our criticism of his failure to ap- 
by qt tests by saying that good hypotheses can easily be buried 
fail t e mechanical application of significance tests. We agree, but we 
Peci “4 see how that argument justifies ignoring entirely such tests, es- 
See y when doing so leads to the conclusion, concerning ‘one of the 
tom discriminating questions,” that a relationship exists which (when 
n KAk by us) turns out to be a mere correlation of .06 for 88 cases. 
S eee: defense, Conrad states that “The reader will recognize the 
ae ‘hypothesis-hunting’ nature of the remarks” which describe 
hun. ly the procedure which he and Sanford used. This “hypothesis- 
ting” consists of beginning with the hypothesis that two variables 
(from which it ~ 
ot borne out), 
e for 


in, correlated, collecting data, making a scatter diagram 
ae d be immediately obvious that the hypothesis is n 
examining the extremes and finding that the median Y-valu 


174 QUINN MCNEMAR 


the high and low X-groups “are the same,” then looking at the extremes 
again and noting that the frequencies in the corners of the scattergram 
are actually 3 and 0 for the high as opposed to 1 and 2 for the low group, 
and finally basing the conclusion on 6 of the original 88 cases. Any 
reader who thinks the foregoing an exaggeration is referred to the 
printed report (12, pp. 13-14). 

6. Aside from the above five factual-type points, we would like to 
mention a few factors that tend to make Conrad’s reply sound like & 
convincing rebuttal to our appraisal. Repeated reading of his discus- 
sion along with checking back to our original paper led to some inter- 
esting observations. We noted the frequent occurrence of the debater’s 
“my opponent himself has said,” along with the stratagem of giving 
partial quotations and/or ignoring the context from which quotations 
were lifted. It is not for us to say whether the use of such forensic de- 
vices should be discouraged in scientific writing, but in the present con- 
troversy it is not only our right but also our duty to point out specific 
instances which are apt to mislead the reader who does not have time 
to do detective work. 

a. In connection with the problem of the comparison of dispersions 
on single items, Conrad’s discussion (2, pp. 576-578) ends with “We are 
inclined, rather, to agree with McNemar’s statement at another point 
in his review, that—,” and then he gives the following quotation without 
the part which we here bracket: “[Another advantage of scaled opinions 
is the fact that the] variation within groups indicates the relative homo- 
geneity of groups in their opinion about an issue” (9, pp. 327-328). The 
reader, who is led to believe that we had been inconsistent, woul 
never guess that the quotation had been removed from our context © 
scales to bolster an argument in a context of single items. 

b. Another out of context quotation, from which it would appeat 
that at one place we sanctioned that which we disallowed at anothe™ 
may be found on p. 575 of the reply. Our criticism of the assigning ° 
absoluteness to the so-called neutral point of a scale is countered wit” 
“As McNemar himself has said,” followed by a quotation from a sectio” 
in which we were arguing against, not for, such usage of scales. 

c. At the top of p. 575 a quotation purporting to show our rejection 
of a certain dictum concerning requisite reliability is followed by an- 
other quotation, which is characterized as unimpressive rhetoric an 
said to be the only reason which we offered for our view. In this cas€ 
the first quotation stops short of a part so relevant that had the sentenc® 
been given in full the reader would have grasped our real reason for 
questioning the dictum. ya bay 

d. On ppË573-574 a quotation and subsequent discussion would lead 
one to think that we recognized “temporarily” a situation for which ou! 
tolerance 61 pages later “seems to have vanished,” but the quotatio® 
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did not endorse the thing (summing scores on components) being dis- 
cussed at the later place. 

e. On p. 579, “McNemar’s own criterion” for a scale is cited as sup- 
Porting an argument; reference to the original (9, p. 294) indicates that 
i a not speaking of criteria, but of what attitude measurement per- 

s. 

f. The failure to consider statistical significance in checking hypoth- 
eses (see point 5 above) is supposedly supported in part by a “McNemar 
himself has said” quotation having to do with the dangers involved in 


using small samples. We never suggested that these dangers in signifi- 


cance tests could be surmounted by the simple expedient of ignoring 
the tests, 

g. On p. 580, “McNemar has himself objected to” is followed by a 
quotation taken from a paragraph in which we thought we were merely 
isting some conclusions, not objections. 

h. On p. 585, Conrad says “The following quotation illustrates the 
unsympathetic ‘slanting’ of McNemar’s account.” Then a passage from 
One of several closely related paragraphs (9, pp. 334-336) is given. By 
Starting and stopping at the right places, the illustration possesses a 
glimmer of plausibility, and from the subsequent discussion of the pas- 
Sage the reader is supposed to see that “McNemar is wrong when he 
implies that Cantril has erred.” Before accepting either of Conrad’s 
Verdicts, the reader will do well to turn to the context to see what was 
Not quoted, 


We have not herein dealt with all the things which seem to annoy 
Crespi, nor have we bothered with many points on which we disagree 
With Crespi and Conrad. The preparation of this response would have 
been greatly facilitated if it had been necessary only that we admit our 
errors, 
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Symonps, P. V. The dynamics of human adjustment. New York: Apple- 
ton-Century, 1946. Pp. xiv+666. 


Symonds regards this book as carrying on the Columbia tradition 
of dynamic psychology initiated by Woodworth and Thorndike. He 
Teadily admits, however, that his contribution is “not exactly along the 
lines of its predecessors.” Since his text is frankly and enthusiastically 
Psychoanalytic in orientation, many experimental psychologists will 
regard this as an understatement. 

Dynamic psychology is depicted as a study of the whole individual 
and of how he adjusts to the situations he confronts—how he derives 
Satisfaction of his inner drives from his environment. Process and cause 
Tather than personality status are regarded as the subject matter of the 
field. In fact, the individual as a total personality, his over-all develop- 
Ment and inner organization fall into the background in this volume as 
Processes are developed. This is shown in the treatment of the ego con- 
cept which is not discussed in one section but presented fragmentarily 
throughout the book as various other Freudian concepts are handled no- 
mothetically, 

‘Reason and intellect are dethroned as the principal factors in 
adjustment” the preface states, and “adjustment [is] primarily a mat- 
ter of the reactions to frustration and the individual’s attempt to avoid 
anxiety... , The use of reason in meeting problems of adjustment is 
reached, if at all, only in maturity as the result of high intellectual en- 
dowment and the capacity to profit by experience.” “.. . adjustmentis 
Carried on through the impulses, emotions, and by means of the mecha- 
nisms which are described herein.” It would be interesting to know the 
Extent to which members of the profession have reached this same con- 
Clusion and the effect in such cases of this view upon their theory of 

Ucation, their courses and research in psychology. 

Symonds has not been known as a psychoanalytically oriented 
Titer, Many who are acquainted with him as the author of an excellen* 
Organization of the personality measurement literature in Diagnosing 
| ersonality and Conduct will find this volume surprising. The differences 
n interests and viewpoint shown by the two contributions may reflect 
a trend in psychology as well as the capacity of an alert, scholarly 

Psychologist materially to change his approach in middle life. 
he author purposes to represent the conclusions of contemporary 
research and discussion on the problem of human adjustment. He states 
at as he combed the literature he concluded that the psychoanalytic 
contributions commanded the field. The basic theory of Freud and of 
By Psychology are presented with the refinements suggested by later 
udents including Horney, Fromm, Reik, Fenichel, Deutsch, Alexan- 
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der, Klein, and Isaacs—writers whose work is too little known to stu- 
dents of adjustment theory. The volume is not written primarily from 
a background of system and concepts of traditional psychology with 
psychoanalytic literature interwoven, but rather the converse. The 
chapters on drives, frustration and conflict allude to papers by psychol- 
ogists. Many of the other chapters such as “Love and Self-Love, 
“Guilt and Self-Punishment” and ‘Miscellaneous Mechanisms” con- 
tain very few contributions from members’ of the American Psycholog! 
cal Association. On the whole the experimental studies on personality 
mechanisms seem ineffectively presented. 

In his preface Symonds anticipates criticism on the score that state- 
ments are made without apparent evidence, which makes the book 
seem dogmatic. He emphasizes in rebuttal that when one makes an in- 
timate and detailed study of human life through clinical methods, it 18 
done under conditions by which it is not possible to duplicate data. 
Moreover, he contends, “ . . . when relationships are verified by repeti- 
tion of scores of investigators, relationships between observed behaviot 
and the dynamic processes which motivate it gain credence.” This 
presentation may seem unnecessarily weak to the many students who 
come through introductory courses which emphasize objective methods 
and attitudes, in which experimentsare cited for each generalization ma es 
Too, the students from nonscientific curricula who will use this tex 
need a cautious approach to broad clinical generalizations, which this 
text lacks. It is regrettable that Symonds did not take a more critica 
attitude toward some of the findings as he presents them, and at least 
acquaint the reader with the methods of obtaining these data and wit? 
well-known attacks on Freudian theory and technique. Furthermor® 
the questions of relative reliability and validity of data might have bec? 
raised. The student is at a loss to distinguish fact from theory anc 
hypothesis. Too seldom is he told that additional studies relevant to @ 
given concept are sorely needed. 

Statements such as this will arouse criticism: “ .. . there is als? 7 
part of the ego that says, ‘Do’ and ‘Don’t’ that is also unconsciou® 
(237). And many will want to know the degree of validity of assertion® 
such as: “‘[early frustration in oral experience produces] mouthy folks— 
verbose in expression, persons who like to pull on a big cigar or heav 
pipe, or chew on the stub of a pencil.” al 

The mixed style and irregular organization within chapters wil 
irritate many of the more perfectionistic readers. The integrated pê 4 
tern of psychoanalytic insights tends to be lost as Symonds attempt? 
to organize mechanically the literature in terms of single mechanisms: 

Symonds deserves great credit for his efforts toward the integr atio 
of a body of widely varied literature, for his attention to truly signi 
cant, complex and somewhat inscrutable problems, for his wholeso™® 
psychological treatment of social problems arising from individu? 
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maladjustment, and for his over-all organization of the major concepts 
of dynamic psychology. This respect is enhanced when his volume 1s 
compared with some other publications in the field. The 63 pages de- 
voted to notes concerning the 883 references are a contribution in their 
own right. This work should give pause to the personnel worker who 
dispenses superficial advice and recommends social maneuvers to allevi- 
ate deep conflicts. 
Frep McKINNEY. 


University of Missouri. 


BENEDEK, THERESE. Insight and personality adjustment: a study of the 
liad effects of war. New York: Ronald Press, 1946. Pp. xii+ 


As a psychoanalyst of the staff of the Chicago Institute of Psycho- 
analysis, Dr. Benedek is probably fairly orthodox. As the writer of a 
ook, practical and timely, and geared to the immediate clinical prob- 
ems of today, Dr. Benedek seems, to the reviewer at least, to differ 
Sharply from most orthodox psychoanalysts. With brief acknowledg- 
Ment to the theoretical substructure of psychoanalysis, the author ad- 
Tesses herself promptly to that large body of personalities today who 
Teflect the tremendous need for personal, postwar readjustment. The 
peteran is given careful consideration, not as the species veteran, but as 
he is a husband or a father or a son. What he has in common with other 
veterans—the war itself—is evaluated psychodynamically as an homog- 
€nizing influence. Just as carefully does the author evaluate the war 
as an experience for the wife and the mother of the veteran, the father 
and the child, the brother and the sister. 
b To infer from this that the author is focussed on personalities would 
€ unfortunate, for her concern seems to be primarily with situations: 
Separation, continued separation, reunion, personal readjustment. 
hese situations and the behavior they demand are presented each as a 
Psychodynamic equation. z 
š To the persons involved with it in various ways, the war experience 
erved or failed to serve needs, and an analysis of it in these terms is 2 
€althy exercise for those who today must consider problems of readjust- 
Ment. Dr, Benedek states her hope that this book will be found useful 
Y social workers, clergymen, teachers, counselors, as well as psycholo- 
hai That many such readers will be stimulated by her book I have no 
Oubt, for I liked it. It is clear that Dr. Benedek has thought her way 
Tough the present world crisis as it is involved in clinical work. Much 
aare than that, she reveals an insight into present-day problems of per- 
Onal adjustment which must derive from more than psychodynamic 
cory and thinking alone. She has seen real people trying desperately 
© meet’ today’s problems. 
Just how this book will be used by counselors, it is difficult to guess; 
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interpersonal relationship involved. Whether or not the book will serve 
as such a reference work, the reading of it as a whole will provide an 
intelligent psychodynamic analysis of the present world crisis as it 
troubles the individual today. As a book about individuals this is 4 
book in clinical psychology, and I think it has, from this viewpoint, a? 
unfortunate title. 
Tuomas W. RICHARDS. 
Fels Research Institute. 


Hewitt, L. E., & JENKINS, R. L. Fundamental patterns of maladjust 
ment; the dynamics of their origin. Institute for Child Guidance 
State of Illinois, 1946. Pp. 110. 


From the files of the Michigan Child Guidance Institute 500 cas? 
records were selected and certain information in these records enterè 
on a code schedule. ‘Three hypothetical behavior syndromes were sug- 
gested including the following items: (1) assaultive tendencies, temper 
displays ...etc.; (2) gang activities, . . . stealing, . . . truancy, ete! 
(3) sensitiveness, seclusiveness, shyness, etc.” These were formalized 4° 
(a) unsocialized aggressive behavior, (b) socialized delinquency be- 
havior, and (c) over-inhibited behavior syndrome. 

A very large number of tetrachoric and biserial correlation coefficients 
were obtained between the items and combinations of items. The mass 
of the statistical analysis is presented in a fashion which the reviewer 
found confusing. He was unable to see the meaningful forest because ° 
the crowded numerical trees. It is his impression that most of the tra 
(items) which intercorrelate highly, for example, .67 between seclusiV@ 
ness and shyness, are self-evident. No clear line of evidence was appar 
ent indicating that other than obvious relationships were found. The 
Ence may be there, but if so it is obscured by too many statistic? 
tables. 

Sample case histories are given to illustrate the personality develop” 
ment of the three syndromes. Dr. Jenkins adds a chapter giving “, 
psychiatric interpretation of this study, together with its implications 1 
guidance and treatment. l 

The authors concluded that, 


This study has served to provide a statistical verification of certain prev 
ously established hypotheses ... concerning the nature and backgrounds 
the aggressively delinquent and the pseudo-social delinquent. The situation® 
correlates of the third type of behavior (overinhibited behavior) have been re 
vealed through empirical analysis. In each of the three behavior-situation P@™” 
tern relationships there appears to be some evidence that not only is the beh ff 
ior in question provoked by a particular type of frustration but the general pat 
tern of behavior itself is exemplified by other persons with whom the child is? 


E 
for it is in a sense a compendium of dynamics, codified according to the 


| 
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close, contact. Thus the resulting type of maladjustment would appear to be 
a rational reaction of the child to his distorted environment in a double fashion. 


C. LANDIS. 
New York Psychiatric Institute 
and Columbia University. 


THORPE, Lours P. Child psychology and development. New York: 
Ronald Press, 1946. Pp. xxvi+781. 


This book is intended as a text for courses in education and psychol- 
ogy. Many instructors, however, will find it more useful as a reference 
source for their own use, and that of their more advanced students. The 
book is long, repetitious, and at times the writing is not clear. In some 
instances the selection of research materials, and in many cases their 
interpretation, is extremely biased. For these reasons it does not meet 
the needs of the average undergraduate. 

Each chapter begins with an historical and theoretical discussion of 
the topic to be presented. There follows a discussion of findings, includ- 
ing a series of presumedly typical investigations and a “summary and 
implications” in support of Thorpe’s own point of view. The book is 
heavily documented through the use of many long quotations from the 
educational and psychological literature and numerous footnotes. 
Questions for study and a list of references are appended to each chap- 

er, k 

Chapters two, three, and seven, What the Child Inherits, Mental 
Abilities: Nature and Nurture, and Intelligence and How It Develops, 
deal with the operation of heredity and environment in the process of 
child development. Thorpe obviously has strong environmental lean- 
Ings and is, of course, entitled to his point of view. However, this re- 
viewer questions the use of such terms as “congenital inheritance” 
(pp. 71-72) and “experiential maturation” (p. 144). Few psycholo- 
gists today would deny the mutual interdependence of hereditary and 
environmental influences from the moment of conception. But it seems 
unnecessarily confusing to use “inherited” for such environmentally 
Produced effects upon the child while in utero as those caused by mal- 
nutrition, disease, or injury to the mother, even with “congenital” as a 
qualifying adjective. Nor does it seem justifiable to nullify the term 

maturation” which by definition means that part of the developmental 
Process which is impelled from within, by preceding it with a word 
descriptive of extrinsic factors. Thorpe draws a clear-cut distinction be- 
tween physical attributes which he concedes to be inherited, and all 
Mental and personality traits which in his opinion have neither struc- 
tural nor innate bases. 

Although both sides of the nature-nuture controversy are presented, 
Tesearches supporting the hereditarian view are sometimes out-dated 
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where more recent and better investigations are available. The usual 
misconceptions by which the controversy is perpetuated are reiterated 
in this book: accepting test results at face value, after warning others 
not to do so, assuming that because certain conditions invalidate the use 
of tests that the function tested has changed rather than that the 
measuring instrument has been rendered useless, and finally, in lieu of 
conclusive evidence, holding the view that intelligence per se is improva- 
ble because we would like to think that it is so. It is unfortunate that 
these chapters take up so large a portion of the first part of the book. 
Many will read no further and miss some of the more acceptable mate- 
tials presented later on. 

Thorpe rejects instincts and drives in favor of basic needs as the undet- 
lying motivational forces of child behavior. The child continually strives 
to satisfy these needs, and society’s role is to see that he succeeds in @ 
socially acceptable manner. Since the author recognizes no innate per- 
sonality limitations, a suitable environment is all that is required to 
produce children free from frustration and conflict. The individual pat- 
ent is warned against ‘‘over-acceptance” of the child, but apparently 
there is no limit to the extent to which society-in-general should plan 
an environment accurately attuned to the child’s needs. 
fs Emotions are ‘‘stirred-up states of the organism,” resulting from 
failure of the child to satisfy his needs. The traditional (puritanical) 
point of view, that strong emotions are undesirable and to be avoided, 
is upheld by the author. Pleasant emotions receive scant attention. 

In keeping with Thorpe’s strong environmental bent, the best chap- 
ters in the book are those dealing with Effects of Early Home Conditions, 
The Social Education of the Child, Safe-Guarding the Child's Personality, 
and Mental Hygiene. An extensive presentation of theories, terms, an 
research findings is followed by many sound suggestions by which pat 
ents and teachers can improve the conditions under which children liv® 
and learn. 

For the mature student, capable of making his own selection from 
the wealth of material presented, the book will be a welcome addition 
tc his sources of reference. He will find useful, also, the numerous table 
summaries of research studies. 


3 r KATHARINE M. MAURER. 
University of Nebraska. 


SCHROEDER, E. M. On measurement of motor skills; an approach through 
a statistical analysis of archery scores. New York: King’s Crow”? 
Press, 1945. Pp. xvi+210. 


An interest in measurement and testing in the field of physica! 
education activities was the motive for the research reported in this 
book. Archery was selected as the subject of study on the ground that 
it is a highly routinized activity carried out under relatively well stand- 
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ardized conditions; its score is objectively measured, accurately re- 
ot and commonly accepted as a valid measure of the performer's 
ill, 

Based on data collected over a period of six years from the per- 
formances of 258 beginning, 139 intermediate, and 53 advanced archery 
students at Wellesley College, and limited in scope to the investigation 
of the behavior of the score, the study is divided into two parts. In Part 
One, which deals with the trend and reliability of the score in successive 
lessons, the purpose is to evaluate score units of stated length as tests of 
group and individual ability and improvement. Part Two concerns the 
behavior of the score within a single lesson with the object of determin- 
ing whether successive small units of score reflect the effects of practice 
and fatigue, and whether such a lesson is long enough to measure indi- 
vidual skill accurately. 

Because of the standardized nature of archery, many of the findings 
of the study—such as, for example, that the sum of the best two scores 
estimated individual skill more accurately than did the sum of the last 
two scores of the series of lessons, that the Range score on twenty-four 
shots was sufficiently reliable to be a useful test of group ability, that 
the lesson of standard length was too short to measure accurately the 
skill of the majority of subjects—will have direct application to testing 
in that field. Taken as a whole, however, the results obtained illustrate 
and emphasize a fact which is of prime importance in the interpretation 
of test scores, namely, that a score made at any stated time is likely to 
estimate ability with varying degrees of accuracy. For the light it 
throws on the nature of scores, the study should be helpful in suggesting 
Procedures to improve the accuracy of measurement in other sports 
activities. 

The methods by which the various types of problems undertaken 
were handled are presented in detail. Careful definition of terminology 
and the adequate treatment given to the statistical techniques employed 
add to the effectiveness of the presentation. As an exposition of testing 
Problems that are unquestionably fundamental, the book is an excellent 
contribution. 9 

James M. Lyncu. 

Personnel Research Section, AGO. 


Epwarps, ALLEN E. Statistical analysis for students in psychology and 
education. New York: Rinehart, 1946. Pp. xviii+360. 


This book is intended primarily for students of psychology. The 
Conventional chapters on scaling, reliability, and validity are omitted 
on the grounds that they belong in courses on tests and measurements, 
while consideration of multiple correlation is postponed to more ad- 
vanced courses, The author begins with a simple review of basic arith- 
Metical computations, such as fractions, decimals, square roots, posi- 
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tive or negative numbers, and then goes on to the treatment of meas- 
ures of central tendency and variability. A notable omission is the 
graphic treatment of distributions, and this is regrettable since some 
guidance in the graphic presentation of frequency polygons (or histo- 
grams) is an essential part of astudent’s training in elementary statistics 
if his training is to help him in subsequent laboratory or field work in 
psychology. 

The treatment of fiducial limits and reliability of means, which $0 
often is either garbled or even erroneously presented in some elementary 
texts, is treated here accurately but in a rather verbose manner: It is 
gratifying to see that analysis of variance has been finally divorced from 
the barn-yard imagery which adheres to this term in agricultural an 
biological textbooks. 

The chapter on x? is primarily devoted to the treatment of qualita- 
tive or enumerative data and it would seem better to give it such a title. 
In general, this tendency to deal with the names of special techniques 46 
chapter headings rather than with the contents of these techniques !§ 
unfortunate, for it gives the student the feeling that statistics is made 
up of disparate tools. It would be much better to organize the content 
according to topic and let x? and comparison of percentages, for exam: 
ple, fall into the same group. Similarly, analysis of variance could well 
form part of a chapter or a series of chapters on comparison of means 
and thus give the student an integrated point of view regarding critical 
ratios, ¢ tests, F ratios, and correlation ratios. This is done to some ex 
tent on p. 199, but because of the segregation of the techniques into 
separate chapters it may appear to be an accident rather than a neces: 
sary result. By bringing these methods together, the consistency ° 
different treatments can be demonstrated and the unity of statistical 
method stressed. 

The final chapters on sampling prediction and design of experiment? 
fulfill a basic need in an elementary text. 

In general, the text’s chief advantage lies in the fact that it draws its 
material entirely from psychological sources and thus makes the stu 
dent of psychology feel quite at home within its pages. In some TO 
spects, it descends to quite an elementary level, presuming very little, 
perhaps too little; on the part of the student. In other respects, SP?” 
cially in the chapter on analysis of variance, it goes far beyon mi 
proper limits of an elementary text. 

In common with the majority of elementary texts, this book devote? 
little space to the consideration of even simple curvilinear relationship 
with the result that the student gets the impression that psychologic? 
data are primarily linearly related, and that functional or curviline® 
relationships do not exist, are due to artifacts or are beyond the pale ¢ 
psychological investigation. As a consequence, the student of psycho” 
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ogy is completely at a loss when he begins to deal with such nonlinear 
relationships as the Weber-Fechner Law, the many functional relation- 
ships in the field of visual and auditory perception, and the growth 
curves in developmental psychology. The treatment of functionally 
related variables is no more complicated than that of analysis of vari- 
ance or the correlation coefficient. Since it is so useful to the student of 
psychology it should find a place in the elementary texts. 
JOSEPH ZUBIN. 
Columbia University and 
New York Psychiatric Institute. 


GOTTSCHALK, L., KLUCKHOHN, C., & ANGELL, W. The use of personal 
documents in history, anthropology and sociology. Social Science Re- 
search Council Bull. 53. New York: Social Science Research Coun- 
cil, 1945. Pp. xiv+243. 


This Bulletin comprises three separate and distinct monographs. 
Gottschalk gives brief and concise descriptions of historiography and 
historical method in the use of documents. In similar fashion he deals 
With the principles of external and internal criticism of documents. 

hese principles are usually more important to historians than to 
Psychologists and social scientists, since the latter seldom have reason 
to question the authenticity of documents which come to them. Never- 

eless, social scientists frequently use historical method, and Gott- 
Schalk’s suggestions regarding the use of documents should help them 
ro avoid pitfalls in their excursions into historical fields. 

Kluckhohn points out that anthropologists, unlike their colleagues 
1n other fields, seldom work with materials which have been produced 
Spontaneously for confidential use. The documents employed by anthro- 
Pologists are usually written at the instigation of field workers, or are 
Merely records of interviews. Kluckhohn’s critical review of such docu- 
ments, including a number of biographies, is an excellent guide to the 
n erature, The suggestions which he makes for future research, and for 
saking available more of the original and detailed notes from fieid 
i ork, represent no mean contribution to methodology and technique 
n anthropology. ` 
d Angell catalogues and discusses the various ways in which personal 

Ocuments are used by sociologists, emphasizing that such materials 
Oth yield hypotheses and contribute to the verification of hypotheses. 
€ reviews the principal sociological studies of documents under two 
ea adings: (1) explanation of historical sequences, (2) contributions to 
ciological theory. A brief chapter on sociological method is followed 
24 one on suggestions for obtaining, analyzing and interpreting docu- 
ents and demonstrating the validity of hypotheses from documents. 
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No brief review can do justice to the importance of these monographs 
or to their usefulness as guides to users of documents. Nor can they be 
evaluated properly without considering them in the light of publica- 
tions of a similar nature by Dollard, Blumer, and G. W. Allport, the 
latter two of which are authors of previous SSRC Bulletins. The defi- 
nitions of personal document (or human document, as the case may be) 
given by these six writers are so different and distinct that they suggest 
the analogy of the blind men describing the elephant. 

Blumer defines the human document as “an account of individual ex- 
perience which reveals the individual’s actions as a human agent and as 
a participant in social life.” 

Allport defines the personal document as “any self-revealing record 
that intentionally or unintentionally yields information regarding the 
structure, dynamics, and functioning of the author’s mental life.” 

Angell, for his purposes, defines a personal document as "one which 
reveals a participant’s view of experiences in which he has been in- 
volved” (p. 177). 

Kluckhohn agrees with Dollard that the subject of a personality 
sketch or biography “must be viewed as a specimen in a cultural series, 
but adds that “the analysis will be incomplete unless the universal, 
communal role and the idiosyncratic components of personality are dis- 
tinguished” (p. 138). 

Gottschalk, in contrasting Blumer’s definition of human document 
with Allport’s definition of personal document, regards these adjectives 
as tautological, since ‘every document, no matter now thoroughly the 
author strove to be objective, must exhibit to a greater or less extent 
the author's philosophies and emphases, likes and dislikes, and henc® 
betrays the author’s inner personality” (p. 13). 

These definitions are quoted not so much to emphasize that the 
representatives of the several academic disciplines are not in complet? 
agreement regarding their fundamental concepts, as to point out tha 
in stating their definitions boldly and precisely these writers have Pt” 
vided the key to an understanding of the distinctive methods and tech 
riques characteristic of sociology, psychology, anthropology, and bis 
tory applied to documents. In these days when psychologists, soc}# 
scientists, and historians are being encouraged to pool their concep 
and techniques in grand “cooperative projects,” there is strong tempt” 
tion to urge that all of them should get together and agree on funda 
mental concepts and methods. While such cooperation is desirable, ! 
must be remembered that data, interpretations, and generalizations are 
in part functions of the concepts, methods, and techniques employed 1” 
the research which produced them. Pooling concepts, methods, a” 
techniques may result in data of doubtful value, interpretations devo! 
of clarity, and generalizations lacking precision and validity, unless the 
cooperators have scrupulous regard for the various methodologic’ 
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These monographs are excellent guides to such methodologies, at least 
in so far as the use of documents is concerned. 
ARTHUR F. JENNESS. 


Williams College. 


Litir, Raren S. General biology and philosophy of organism. Chicago: 
University of Chicago Press, 1945. Pp. 215. 


The question—What is life?—constitutes the central theme of Pro- 
fessor Lillie’s book. That he is well qualified to discuss this problem will 
be denied by few; that he has been successful in his attempt to formulate 
an answer will be denied by many. F 

It will come as no surprise to those who have read Lillie’s many 
contributions to the Journal of Philosophy, Philosophy of Science, and 
other scholarly journals, that he endorses a teleological interpretation 
of life, Such a vitalistic view is somewhat novel in a day when scientific 
Sophistication appears to be synonymous with an acceptance of physical 

etermimism. 

Following a brief review of cellular chemistry, Lillie attempts to 
show that the basic fact of chemistry is that the orderly and complex 
distribution of molecules requires energy. He feels that this energy 
Must be applied ‘‘—directively within the organism,” in order to over- 
come the natural tendency for molecules to become distributed into 
random and symmetrical patterns, a condition which is presumed to be 

ue to chance, This natural tendency is expressed in the second law 
of thermodynamics and is a tendency which must be overcome if there 
is to occur any cellular differentiation. The required energy, which he 
refers to as an anti-diffusion factor, is, according to Lillie, essentially 
teleological. That is, it is applied directively with some purpose. 
. For him, the organism is a psychophysical system in which the final 
Integration of the physical and psychical factors appears to be psychical 
rather than physical. The physical is characterized by constancy, by 
Permanence, and by a static nature, whereas the psychical is novelty, 
1S change, is process and occurs only in the present. The psychical can- 
Not be observed externally as can be the physical but is instead felt and 
Self-experienced. This is characteristic of one of the concepts of phe- 
Nomenology. He expressly disclaims a dualistic view but rather feels that 
the two fields, the physical and the psychical, are grounded in the same 
Undamental reality. This corresponds closely to the monistic double- 
spect view of psychology. a 

According to Lillie, the entire psychophysical organism acts as if it 
Were the field of a specifically unifying factor, a factor which has con- 
Stant properties but whose activity is directive and synthetic in essence. 

his action is carried out in pursuit of definite purpose and “psychic 
e becomes dominant and determines the special direction of what 
lappens in the physical world. For example, Lillie rejects the theory 
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that the natural selection of purely fortuitous variations (e.g. mutations) 
is a sufficient explanation of the origin of the entire range of adaptive 
characters. He definitely feels it necessary to explain evolution in terms 
of a directive purpose. 

The crucial point in Lillie’s theory, and one which he quite clearly 
recognizes as being of vital importance, is how the psychical can operate 
on and give direction to a physical system when the psychical factory 
as such, cannot exert physical force. He attempts to get around this 
dilemma by placing the locus of psychical control internal to or behind 
the elementary physical events (quantum transfers). How this solves 
the dilemma is not clear. It appears to be a restatement of the problem 
in.another way and.on another level. While some may feel satisfied that 
Lillie has found a locus for the psychical factor (the reviewer does not), 
the way by which it can exert physical force is still an enigma. 

How then are we to evaluate this book? Is it possible that the author 
has stated his views too soon, that he should have waited for more 
evidence? If Professor Lillie were at the outset of his career, we might 
reasonably expect him to remain an agnostic until more evidenc® 
should become available. But he is not at the beginning of his career: 
He is already an emeritus professor. Furthermore, his lifetime in the 
laboratory has given him much experience that could be obtained in n° 
other way. And is it not worthwhile for younger scientists to learn of his 
experience? 

It is probably impossible to correctly say whether Lillie is right 0" 
wrong in his assumption of a teleological interpretation of nature. True 
he does not have the weight of numbers in support of his position, but 
for that matter neither has any other scientist who has dared to break 
with tradition. Surely the history of science has taught us the folly ° 
mere majority opinion. It must also be taken into consideration t ha 
there is more hope in Lillie’s view than in that of the physical determi”? 
ist, but, unfortunately, wishing does not necessarily make a thing 50% 
Perhaps the chief merit of the work is in the possibility that it will sti?” 
ulate further research on the problem from which an unequivocal solt- 
tion may be derived. 


re Rosert P. FISCHER- 
University of Florida. 


FALES, WALTER. Wisdom and responsibility—an essay on the motivatio" 
of thought and action. Princeton: Princeton Univ. Press, 1946. Epi 
166. 


In this closely-written, highly epigrammatic, and occasionally ob- 
scure series of ten essays, an objective theory of value-experiences R 
presented. Like Koffka, the author holds that we are earlier aware ° 
the subjective values of objects than of their spatial and temporal aa 
tributes. These ‘‘subjective values” are motives; objective values ar 
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mental categories in terms of which motives are “innately” interpreted 
and judged. Intentions are prior to thoughts, decisions prior to insights. 
We are able to organize because we are ourselves previously organized 
and to create because we are creatures. “Everything which has weight 
in a man's life or meaning in his thinking derives its structure from his 
final ends” (p. 161). 

Individuality itself is nothing but the ability to regulate the growth 
of motives in such a way that situations become solvable (p. 26). Every 
individual develops in a chain of decisions which can be understood as 
evidencing a system of objective values; but our genuine decisions are _ 
determined by final ends which we do not even see although we have a 
relation to them. Personality or “model individuality” is the response to 
calls rather than to needs, i.e., we stand for something that is bigger 
than we are (p. 68). 

. _ Educationally, this outlook leads to the important suggestion that 
it is the function of intuitions to perceive wholes which account for the 
Coherence and meaning of that which otherwise would remain below the 
threshold of interest and attention. For example, knowledge obtained 

y way of inference is encyclopaedic and the property of all who are 


` intelligent enough to take possession of it; but more fundamentally, 


those impulses that lift us up to specific planes of understanding are 
Symptoms of an anxiety which drives us to master the problem ‘“‘world” 
in accordance with standards set by our final ends (p. 148). This defines 
the problem of the educator as the control of the momentum of powers 
active in shaping the better selves, i.e., the system of demands of other 
Persons—all because human life is essentially obligation. 

Superficially, this seems like a contemporary restatement of ethical 
transcendentalism, but on closer scrutiny it takes on more of the char- 
acter of a novel form of radical empiricism or field theory that seeks to 
Place the Jamesian marginal in experience on the same or even loftier 
Ontological footing than the focal regions of our phenomenal world. 

nly in such a context does the assertion that we are least selfish when 
We are most creative become intelligible. The key suggestion that mo- 
tivation is ultimately impersonal in the sense that the organism releys 
Orces greater than itself is truly profound, and may compel some useful 
Tevisions in the need schemata now popular among applied psycholo- 
8ists and “social engineers.” 
GEORGE W. HARTMANN. 
Teachers College, Columbia University. 


Sarcenr, W. E. Teach yourself psychology. Philadelphia: David McKay, 
1946. Pp. 159. 
Teach Yourself Psychology is the sort of popular presentation that 
mae hopes will not have a wide audience. The first chapter, consisting of 
ittle more than a listing of the names and dates important in the pre- 
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experimental history of psychology, furnishes some basis for believing 
that this hope will be justified. In the somewhat more readable discus- 
sions that follow, covering among others the topics of instincts, structure 
of the mind, and dreams, most of traditional experimental psychology is 
included in a single chapter. This reflects, of course, the author’s clinical 
(and religious) bias. He is still fighting the battle against Watsonian 
behaviourism, and he fights with the same religious zeal manifested by 
critics 20 years earlier. His own theory is taken in about equal parts 
from Freud and McDougall. The tone of the volume is typified by the 
following sentence: ‘‘Further, if man is nothing more than a machine 
which reacts to the world around him as a typewriter responds to the 
touch of the typist, how can we explain the visions of prophets and the 
dreams of seers whose thoughts are unique and in advance of their gen- 
eration.” The concluding sentence contains another example of the 
same sort: ‘Its (psychology’s) final aim is not merely to state how man 
thinks, feels, and acts, but how he can do these things much better and 
more in accordance with the Divine purpose that lies at the back of all 
things; . .. ” If the present volume is typical of the rest of the Teach 
Yourself series, the publishing effort involved would seem to be quite 
ill-advised. There is little question, however, that it is easier to make 
a mistake of this sort in psychology, than, for example, in algebra. 
Lioyp G. HUMPHREYS. 
University of Washington. 


BLUMENFELD, WALTER. Introducción a la psicologia experimental. 
Lima: Editorial Cultura Antarctica, S. A., 1946. Pp. 417. 


For more than a decade Dr. Blumenfeld has furthered the interests 
of psychology in Peru, and the influence of his prolific and constructive 
activity has extended over much of South America. His production 0 
this introduction to experimental psychology in Spanish is likely to be 
of great importance in establishing psychology as an autonomous field 
in regions where it exists largely in an incidental way in relation to other 
interests. 

o The book is a précis of experimental psychology, covering in 24 well- 
arranged chapters all the topics one would expect, supported by the in- 
clusion of very recent data. A full bibliography is drawn from a wide 
range of experimental literature and convenient tables, clearly printed 
charts, and excellent diagrams including some plans of laboratory ap- 
paratus, are extensively used. In style of language the exposition is 
consistently clear and economical. Publication was sponsored by the 
Instituto Psicopedagégico Nacional, which organization, along with 
the publishing concern, deserves considerable praise for a well-executed 
undertaking. In a straightforward preface the author tries to make 
clear to his readers the basic importance of experimental procedures for 
all psychological work, and thus provides an emphasis that has been 
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lacking in some parts of Latin America. In discussing such topics as 
psychoanalysis and personality he shows an enlightened scientific regard 
for the bearing they have upon experimental psychology. More than 
half the book is devoted to sensory and perceptual processes, and the 
psychophysical methods are well presented in their historical contexts 
as well as in present applications. It is interesting to observe through 
Dr. Blumenfeld’s portrayal the greatly varied utility of rathér a small 
a ber of scientific principles for building up the total experimental 
eld, 

Without introducing any dogmatism of theoretical approach the 
author quietly and effectively achieves his intention of allowing experi- 
mental psychology to speak for itself. The logical background is highly 
eclectic. And when it becomes necessary to make a break with the 
somewhat speculative philosophical tradition of Latin-American psy- 
chological thinking, the break is always made gently, considerately, and 
courteously. The strong presentation of a needed emphasis amidst an 
atmosphere devoid of polemics is a tribute to Dr. Blumenfeld’s ground- 
ing as a psychologist and skill as a writer. 

Howarp Davis SPOERL. 


American International College. 


Linx, Henry C., & Horr, Harry ARTHUR. People and books. New 
York: Book Manufacturer's Institute, 1946. Pp. 166. 


. Will the war-time boom in book-buying decrease with the peace- 
time availability of essential and luxury items? The true guide to the 
future of books is the reading habits of people, not just the present sales 
figures. In 1945 the book industry’s business reached a half billion 
dollar total and yet up until that time no one had scientifically investi- 
gated the market. 

Through the combined efforts of all the major groups of businesses 
which participate in the production of books, the Psychological Corpor- 
ation and the Hopf Institute of Management were retained to meet the 
need for a reliable and accurate index of consumer book reading and 
buying habits. An extensive survey was organized to answer the main 
questions about which the publishers and others of the book industry 
had heretofore been able only to conjecture and theorize. 

Four thousand interviews were conducted with consumers between 
May 21 and June 8, 1945, after a six month period of developing meth- 
ods and revising questionnaires. Acting as field supervisors, sixty-two 
psychologists directed the work of the interviewers in 106 cities and 
towns. 

_ The results of this research show clearly that income is not the dom- 
inating influence, but rather that formal education is the determining 
factor in book reading. To be sure, people have price preferences 1m 
books, but these are not based entirely on economy; often they reflect 
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motives which can be appealed to by the makers of both the lower- 
priced and the higher-priced books. 

Book stores, book clubs, and department stores, in that order, are 
the principal sources for the consumer purchase of books. Purchasers do 
not mean readers, however. In this study it was found that best-seller 
lists give no indication of the number of people actually reading the 
books. During the period preceding this survey the reading of Forever 
Amber equalled if not surpassed the reading of the Bible! 

The results show that the population spends twelve times as much 
time per day on newspapers, magazines, radio and movies as it does on 
books. The reviewers do not feel, however, as did the investigators, that 
this necessarily indicates so great an amount of available book reading 
time. We feel that before any such generalization as to potential book 
markets can be made, the purposes for which newspapers and magazines 
are read and the activities-while-listening of radio users must be ascer- 
tained. 

Forty-one per cent of the persons surveyed in this study said they 
own less than 100 books; thirty-four per cent claim to have more than 
100 books. The accuracy of respondents estimates was not empirically 
verified bya sample. ` 

A parallel survey was conducted with dealers, distributors, publish- 
ers, and educators. The results corroborated the consumer survey in 
predicting an expanding market for books. 

Two suggestions occur to the reviewers for possible improvement of 
the study: (1) an analysis of conditions under which the book industry 
should decide to contract, maintain, or expand future production facil- 
ities, so as to utilize more fully the present data, and (2) the develop- 
ment of a predictive formula for future commitments to be based upon 
the comparison of book sales with survey findings when, as the in- 
vestigators suggest, surveys of this type are repeated. 

Nancy C. CooLEY. 
ROBERT H. SEASHORE. 
Northwestern University. 


CLEETON, G. U., & Mason, C. W. Executive ability: its discovery and 
development. (2nd Ed. Rev.) Yellow Springs, Ohio: The Antioch 
Press, 1946. Pp. iii+540. 


According to the authors the purpose of this book is to report and 
coordinate the best available information concerning the qualities nec- 
essary for proper performance in executive positions. The topics treated 
range from neuromuscular activity to a definition of democracy. Em- 
phasis is placed on problems of selection and training and the role of 
executives in labor relations. The treatment appears to be strongest in 
the last area and less strong when the discussion turns to scientific priD- 
ciples of behavior. Typical of the latter case are the acceptance and us 
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of terms such as instinct, will, and mental capacity without a clear 
statement of operations defining them. 

_ The major shortcoming of this volume is that sweeping generaliza- 
tions, presumably based on research findings, are reported without the 
necessary information to evaluate the instruments, procedures, statisti- 
cal tools, and data. A few other criticisms may be listed. The presenta- 
tion of selection procedures (roughly 125 pages) omits the critical con-. 
cept of cross validation. Over-emphasis on rating scales leads to the 
ignoring of more objective measures of proficiency. The authors be- 
lieve that there is no communality among the traits necessary for suc- 
cess in different executive positions; this position implies a research pro- 
gram for each executive job, certainly an impractical task. s 

It is the opinion of the reviewer that the attempt to treat this im- 
portant applied field is admirable and that this book may be useful in 
presenting elementary notions for the non-psychologist working in this 
area. Itis felt, however, that the book falls short of its stated purpose of 
summarizing the best available data concerning executive ability. 

WILLIAM O. JENKINS. 
Indiana University. 


BLAIR, GLENN MYERS. Diagnostic and remedial teaching in secondary 
schools. New York: Macmillan, 1946. Pp. xv +422. 


Dr. Blair’s book has a two-fold purpose: “to supply teachers, prin- 
cipals, supervisors, ånd superintendents with concrete and practical sug- 
gestions for carrying out remedial programs” in secondary schools, and 
to serve “as a basic text in courses in diagnostic and remedial teaching 
aon in teacher-training institutions.” Accordingly, the book is divided 
into three parts. Part I, consisting of the first seven chapters, is devoted 
to “Diagnostic and Remedial Teaching of Reading.” Part 2, consisting 
of chapters 8 through 11, deals with remedial work in secondary schools, 
in arithmetic, spelling, handwriting, and the fundamentals of English 
respectively. Two chapters dealing with the making of case studies and 
P eppranoa for remedial teaching, comprising Part 3, complete the 

ook. ` 

Dr. Blair is not concerned with the theoretical implications and 
ramifications of remedial teaching on the secondary level, including even 
college, as he suggests. He does not treat remedial teaching on the 
secondary level as a necessary evil, the need for which ought not to exist 
and which ought to be prevented. Reading, Dr. Blair states ‘is such a 
complex skill that it is possible for the elementary school merely to 
initiate the process and to develop a few of the basic skills” (p. 3). Ac- 
cordingly, he devotes his first chapter—the introduction—to statements 
as to the importance of reading, the extent of reading disability on the 
secondary level, the effects of reading retardation, and the meaning © 
remedial teaching. Thereafter, he sets forth the procedures, techniques 
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and plans involved in remedial teaching, in the faith—while admitting 
the desirability of thorough preparation—that any competent teacher 
can make a contribution to this area. In the words of Dr. Blair, ‘For 
after all, remedial teaching is just good teaching” (pp. 405-406). 

The subjects dealt with range from the means of locating the de- 
ficient student and the causes of deficiency and their discovery, to the 
materials and exercises for the improvement of these deficiencies and 
the manner of employment of such materials and exercises. 

A nation-wide survey of remedial teaching in secondary schools, 
made by the author in the spring of 1940 and discussed in chapter six 
of the book, formed one of the chief sources for the book’s material. 

Twenty illustrations and 22 tables, thorough documentation for 
practically every important fact and/or statement made, and additional 
extensive references at the end of each chapter, except the last, enhance 
the value of the volume. 

Jacos I. HARTSTEIN. 

Long Island University 

and Yeshiva University. 


Deutscu, A. The mentally ill in America: a history of their care and 
treatment from colonial times. New York: Columbia University Press, 
1946. Pp. xvii+530. 


This book was originally published in 1937 by Doubleday, Doran 
and Company. After being out of print for some time, it has been re- 
issued by Columbia University Press. The book is well documented with 
eighteen pages of bibliography, arranged by chapters. There is also 4 
well-prepared sixteen-page index. 

Tracing the history of the care and treatment of mental illness from 
earliest historical times to the founding of the American colonies, the 
author presents the colonial period as one of the most shocking periods 
of our social history as reflected in attitudes toward the mentally ill. 
From this period, the author traces the historical evolution of changing 
attitudes and concepts up to the present-day mental hygiene movement, 
including the story of Clifford Beers, its founder. 

The author describes the end of the American Revolution as the real 
beginnings of American psychiatry under Dr. Benjamin Rush, “the 
Father of American Psychiatry.” Following him, a period of regression 
ensued until the middle of the nineteenth century when Dorothea Lynde 
Dix brought the attention of a shocked nation to the care and treatment 
of the mentally ill and catalyzed it into constructive action. The next 
outstanding period was the turn of the twentieth century which 
heralded “the coming of age of psychiatry in America.” At this time 
“asylums” were beginning to be called “hospitals,” with the gradual as- 
cendancy of the therapeutic over the custodial ideal, the growth © 
private psychiatric practice, and the rise of out-patient clinics. 
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__ Integrated with this history are chapters on the rise of State respon- 
sibility for the mentally ill, glimpses of the outstanding psychiatrists 
who have contributed to the field, the epic battle between proponents of 
restraint and non-restraint, the care and treatment of mental deficiency, 
the criminally insane, laws governing the commitment of the mentally 
ill, and a final chapter on mental hygiene. This latter chapter provides 
a dynamic prospectus for the mental hygiene movement and is well 
worth quoting for its intrinsic value: 


A world of peace and freedom, from which the twin specters of war and in- 
security will be banished, a world of equal opportunity, where people will be 
freed from stunting inhibitions and ‘guilt feelings’ arising from outworn preju- 
dices and taboos, a world where children may lead healthy, happy lives and 
grow into useful, well-adjusted citizens, where the personality is permitted to 
develop naturally and freely, where the individual is given a sense of personal 
worth and dignity, and where his activities and ambitions are integrated with 
the development of group life—such is the goal toward which mental hygiene 
must strive. 


Psychologists may well take exception to one serious omission in an 
otherwise well-organized book. Although he recognizes the role that 
social work has played in this field, practically no recognition is given 
to psychology’s contributions, except for the section on mental defect 
wherein the author discusses intelligence test development. The author 
fails to include the part played by psychologists in assisting psychia- 
trists in diagnosis and treatment through programs of psychological 
diagnostic testing, vocational and educational counselling, various types 
of therapy, clinical research and other related activities. 

However, the book is an important contribution to the history of 
mental disease in this country. It charts the path that must be followed 
if mental hygiene, as well as psychiatry in its true meaning of mental 
treatment, are to go forward to greater heights of accomplishment 
rather than to become static or to regress. 

Jures D. HOLZBERG 

Connecticut State Hospital 


Duntap, KNIGHT. Personal adjustment. New York: McGraw-Hill, 
1946. Pp. xii+446. 


Though apparently meant for the undergraduate and the layman, 
this book appears to promise little enlightenment on the psychody- 
namics of maladjusted behavior. The treatment is dogmatic rather 
than scholarly, as evidenced by the author's choice of expressions, his 
omission of a bibliography, and his neglect to cite the evidence for 
atypical generalizations. For example, the*reviewer™ would have ap- 
Preciated evidence for the statement that ‘‘most neurotics are vegetar- 


ians and have been vegetarians for the greater part of their lives.” 
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Had the author merely failed to make a contribution to knowledge 
or understanding, the book would hardly deserve mention. However, 
the book appears to have had a mission, as stated in the preface, where 
Dunlap writes that “it is high time that psychoanalysis should be pre- 
sented in its true light.” This purpose does not seem to have been ful- 
filled, since the author has evidently succeeded only in presenting psy- 
choanalysis from a strongly biased point of view. Few contextual op- 
portunities were missed to malign Freud and psychoanalytic concepts. 
To the reviewer, it seems that it is no longer fashionable among psy- 
chologists to regard psychoanalytic concepts as if they were merely the 
products of ‘‘mythology” and ‘‘superstition.” 

Dunlap’s first attempt in the present book to present psychoanalysis 
“in its true light” states: “The progress that was expected a generation 
ago to be a resultant of co-operation of psychologists and physicians 
was thwarted by the rise of psychoanalysis, which is based on ancient 
popular superstitions.” It occurs to the reviewer that the prejudice of 
American psychologists a generation ago, and their lack at that time 
of techniques by which to deal effectually with personality dynamics, 
should share in the responsibility for this failure in co-operation. Prob- 
ably the public expression by psychologists of attitudes such as those 
expressed by the author will prolong the difficulties in co-operation 
which the author deplores. 

The reviewer fails to see wherein the total effect of this book can be 
of service to the psychological profession or to the teaching of psychol- 
ogy. At many points Dunlap states atypical opinions as if they were 
the generally accepted doctrines of psychologists. The author might 
have profited by the admonition of modern semanticists that one’s own 
opinions should be so identified rather than being attributed to every- 
body in possession of the relevant evidence. 

In fairness to the author, it should be mentioned that his chapters 
on sex and marriage are vividly frank in the treatment of a subject 
which often is veiled or avoided in college texts of this sort. Had the 
remainder of the book maintained the standard set by these chapters, 
the reviewer might have foreseen for it a more favorable reception than 
it deserves in its present form. 

Bert R. SAPPENFIELD 

Montana State University 


Baxter, E. D. An approach to guidance. New York: Appleton-Cen- 
tury, 1946. Pp. xii +305. 

This book consists of two parts. Part I is a fictional presentation of 
the experiences of a director of guidance in a small public school. Her 
activities, problems, and viewpoints are illustrated in specific incidents 
and conversations with the school administration, teachers, parents: 
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and others. Part II is entitled The Story Interpretation and attempts to 
State in concise terms the principles and methods of guidance and edu- 
cation illustrated in the story. There are 138 such interpretations, each 
clearly related to the story proper by marginal references. Throughout 
both the story and the story interpretation supporting references are 
made to recent literature and the volume ends with a 227-item anno- 
tated bibliography. 

With respect to method of presentation, the book is admirably done. 
The story is interesting and realistic, the interpretations are to the point, 
and the bibliography is skillfully annotated—a rare achievement in it- 
self. With respect to content, however, the book can be evaluated fairly 
Only if the orientation and purposes of the author are clearly recognized. 
In the opinion of the reviewer, the author’s attitude toward guidance is 
best expressed in the following quotations: “every teacher is a coun- 
selor” (p. 199) and “adjusting, happy teachers’ mean adjusting, happy 
Pupils” (p. 3). The “guidance expert” will thus look in vain for a de- 
tailed treatment of the use of cumulative records, aptitude tests, home- 
Toom programs, administrative charts, etc., such as presented by Trax- 
ler, Darley, Williamson, Strang, Reed, and others. Instead he will find 
emphasis upon inter-personal relationships among pupils, teachers, par- 
ents, and administration and upon the place of the guidance director in 
facilitating these relationships. ‘Guidance’ becomes almost synony- 
mous with “education” and the approach to better education is through 
the teacher's own adjustment and personality growth and the cultiva- 
tion of a greater awareness of individual needs. ( 

An Approach to Guidance will therefore find its place in the litera- 
ture on guidance: (1) asa broadening experience to guidance specialists 
Overly engrossed in the mechanical details of administering tests and 
Programs; (2) as a revelation to school administrators of the need for 
teacher, as well as pupil, guidance; (3) as an inspiration to teachers-in- 

' training; (4) as a general commentary on the goals of education in a 
emocratic society; and (5) as a guide to guidance directors under- 
taking a new assignment. It is not a text book in guidance methods, 
however, and will serve primarily as supplementary reading in courses 
pe aden personnel work, public school guidance and student coun- 
seling, 
ALBERT S. THOMPSON 
Vanderbilt University 


Brack, Irma S. Of to a good stari. New York: Harcourt, Brace, 1946. 
Pp. xii +256. 
In an informal, often humorous way this “handbook for modern 
Parents” discusses the common problems and perplexities that are an 
integral part of parenthood. Parents will appreciate not only the sen- 
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sible and non-technical advice directly given but also its underlying 
assumption that parents are on the whole a well-meaning lot of people 
doing a difficult job well. It urges parents to accept and respect their 
own personalities as well as those of their children, and emphasizes the 
reassuring fact that there are thousands of ways of being successful 
either for a parent or a child. 

The usual range of fundamental problems is covered and many of 
the aggravating, if often minor, ones also included. The discussion © 
the essentials of feeding and habit training is rich in practical sugges- 
tions as well as basic in modern theory of the psychological significance 
of these learnings. Discussion of the child developing as a member of the 
family group includes good material on discipline, sex instruction, the 
special problem of the precocious child, as well as the minor worries © 
birthday parties, Santa Claus and the subtle torments of a week of en- 
forced indoor play. Parts III and IV take the child out into the larger 
world of relations to other children, to adults from Grandpa to dentist, 
and the widening horizons of travel and schooling, including an analysis 
of the progressive school’s status. The concluding section on intellectua 
growth and self-expression gives many helpful suggestions concerning 
play materials and the problems of readiness for the three R's, together 
with an evaluation of the roles of radio, movies and circuses in the child’s 
life. 

MIRIAM FORSTER FIEDLER 


Vassar College 
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I. INTRODUCTION * 


. Convulsive behavior, whether in the human or the infrahuman 
subject, invariably excites the interest of the observer because of its 
dramatic deviation from typical response patterns. To contribute 
Significantly to our scientific understanding of behavior, however, the 
observations of seizures under a variety of conditions must be related 
Systematically to one another, with an attempt made to delineate a 
limited number of causal sequences common to the several situations. 
So far as I have been able to ascertain, this has not been done for the 
laboratory rat. This review is designed to bring together some of the 
salient features of experiments utilizing this animal, with the hope that 
the juxtaposition of certain findings will stimulate integrative thought 
in the form of theories of behavior mechanisms." 

The independent variable which the majority of investigators have 
employed to induce seizures in the rat is intense sound. The earlier 
Studies of this type have already been summarized (40). But other 
experimenters have reported that convulsions can also be produced by _ 
such factors as conflict, electrical stimulation, injection with certain 
drugs, and deprivation of any of several dietary elements.? At first 
glance these major variables may appear so diverse that their logical 
Organization under a single heading is impossible. But further con-., 
Sideration suggests that the interrelationships among the numerous 
Parameters are so complex that it would be almost as artificial toves- 
tablish five mutually exclusive categories of seizure. The convulsions 
Observed in the several situations may or may not reduce ultimately to 


* Convulsive behavior has also been studied experimentally in such mammals as the. 
cat (e.g., 43, 116, 138, 140, 173), the dog (18, 23, 70, 147), the mouse (60, 67, 117), the 


rabbit (139), and the pig (18) 

? There is no ae % limit the list of experimental convulsants to these few. Cobb 
(20) summarizes 56 clinical causes ‘and 13 physiologic mechanisms that may underlie 
convulsive behavior. (Cf. also 172.) 
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a common physiological denominator. But even if there still remain 
two or more “kinds” of convulsion, an attempted unification at the 
present stage may well lead to the development of new methods of 
studying the nature of one or another of these varicties. 

` The question may legitimately be raised as to the psychological 
importance of investigations of convulsions which adopt the rat as 
subject. Enough has been written in defense of the comparative method 
that little need be added here. It should be stressed, however, that n° 
transfer of principles from rat to human behavior can safely be made 
until a clearer insight into the complex problem of the rat behavior 
itself is achieved. For the present, then, this field remains of direct 
interest primarily io the animal experimenter. 

There are three principal reasons why the investigation of induced 

convulsions is of value. 


1. The appearance of the seizure may be influenced by a number of factors— 
drug injections, dietary deficiencies, the physiological concomitants of aging 
chronic emotionality, and conflict, for example. Asa consequence, this pattern 
of behavior might be employed as an assay indicator of the presence an 
strength of such factors. With few exceptions (e.g., 8), this possibility has bee” 
little exploited, probably because of insu ficient standardization of the indicato" 
itself. Experiments with this orientation are presumably to be based on suc? 
relationships as are discussed in Section IV of this review. 

2. The convulsion may alter other behavioral and physiological character” 
istics of the organism. Several investigators have reported the modification e 
learned patterns following seizure, and others have attempted to demonstrat 
changes in diurnal activity, physiological condition, and general reactivity’ 
These experiments have for the most part failed to yield unequivocal results, e 
continued work in this direction will do much to clarify the neurological bee 
of these behavior segments and contribute to our fundamental understanding 0 
such matters as the learning process. Section V summarizes the few studlé 
bearing on this type of problem. 

3. The nature of the convulsion is of considerable interest for its own sake- 
Its outward resemblance to human epileptic and induced convulsions has fre 
quently been pointed out, and the similarity of appearance may be parallele by 
a similarity of mechanism. If it should eventuate that nothing can be con 
tributed to the understanding of these human syndromes, at least there is every 
reason to €xpect that the controlled observation of the rat seizure will expan 
our knowledge of basic neurophysiological processes. Theories attempting `- 
describe the physiological substrate are a step in this direction; they are CO” 
sidered in Section VI. 


II. THE CLASSIFICATION Or CONVULSIONS 


From the historical point of view, one might be justified in setting 
up five categories of convulsions as observed in the rat, each “kin 
corresponding to a relatively unique set of experimental conditions. 


|. 
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That this is probably too simple a treatment has already been noted, 
will be elaborated upon in Section IV, and will be the premise upon 
which the logical organization of Sections IV and VI will depend. In 
the present section the oversimplified five-fold classification will be 
outlined, each classification based upon one major causal factor or kind 
of factor (“invariable antecedent") that has been proposed by some 
Investigators as accounting for certain convulsions. 


Conflict-induced Convulsion (‘Experimental Neurosis”) 


_ The experiments which gave the initial impetus to the psycholo- 
Sist's interest in rat seizure behavior were designed to demonstrate the 
efficacy of conflict in the production of behavior aberrations. In 1938 
Maier and Glaser (98) reported that rats forced by an air blast to re- 
Spond to an insoluble problem in a Lashley jumping apparatus some- 
times exhibited behavior termed by them “experimental neurosis.” The 
rat might be confronted, for example, by just one card, responses to 
which had in the past been punished. The resultant neurotic behavior 
was described (92) as including circular running at terrific speed, during 
Which collision with objects in the room was frequent; tonic and clonic 
Spasms of variable extent and intensity; and a peculiar hopping loco- 
motion, Following this active phase a passive phase occurred, during 
which the apparently unconscious rat would maintain unusual pos- 
tures imposed upon it and was extremely insensitive to stimulation. 
Recovery was gradual, and within a few minutes the rat appeared 
quite normal, although his cage behavior was described as typically 
“retiring” (92, p. 30). 

In later researches Maier and his collaborators found that the 
“neurotic” reactions were inhibited by preliminary stimulation with 
Sound, lowering of body temperature, excessive muscular activity 
(94, 101), the development of “abortive” jumping (93) or position 
habits (103), and the previous occurrence of attacks (99). On the other, 

and, subconvulsive doses of metrazol (106), increased intensity of the 
auditory stimulus (99), and confinement in a small, opaque stimulation 
chamber (99, 101) tended to increase the convulsive response to the 

‘conflict” engendered by the noisy air blast. Hereditary influences 
Were also demonstrated (95, 100). 

The first doubt that this abnormal pattern was dependent upon 
Conflict was raised by the report by Morgan and Morgan (122) that 
Tats lacking discrimination training and stimulated in their home cage 
by the high-pitched tones of an air blast sometimes exhibited seizure 

ehavior outwardly identical to that of Maier’s rats. Indeed, as was 
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later pointed out (118), every instance of the “neurotic” seizure re- 
ported by Maier occurred in the presence of loud noise, and some were 
observed completely outside the discrimination situation, when the 
only apparently invariable stimulus was intense sound. Other stimuli, 
electrical and mechanical, were adopted by Maier to force the rats to 
jump in the difficult problem situation, but none led to the convulsive 
pattern. The suggestion was inevitable that it was sound, rather than 
discrimination-conflict, that precipitated the convulsion. Other inves- 
tigators (4, 38, 73, 86, 109, 126, 130, 148, 151, 154, 155) have corrobo- 
rated the observation that untrained animals can often be made to 
convulse by a few seconds! of auditory stimulation. In fact much of 
Maier’s later work has been concerned with the production of seizures 
by various sound sources, with no attempt made to induce conflict by 
discriminative or other training (cf., e.g., 94, 95, 99, 100, 101, 102). 
Because sound seems to be the most invariable antecedent of the con- 
vulsive seizure, because no conflict between formally learned tendencies 
is required, and because the changes originally described by Pavlov 
(137) as characterizing experimental neurosis (temporary loss of dis- 
crimination ability, resistance to the experimenter, and alterations in 
general modes.of behavior outside the experimental situation) have not 
been demonstrated, the majority of investigators now prefer to regard 
the behavior as something other than conflict-induced experimental 
neurosis (e.g., 10, 41, 83, 121, 153). 

Maier's original position is, however, not undefended. Maier him- 
self states that although “. . . auditory stimuli are basic in the pro- 
duction of abnormal behavior” (99, p. 29), “The experimental neurosis 

. .. is not a response to air alone” (92, p. 67). The situation involving 
simple auditory stimulation can be regarded as conflictful, “since the 
auditory stimulus . . . [gives] rise to no specific response but [is] ex- 
canta in nature and the enclosed space restrict[s] general behavior 
fa serve[s] as the inhibitory aspect of the situation” (93, p. 369): 

itterman (12, 13) has argued in the same vein, holding that in the 
stimulation situation there is conflict between the excitatory auditory 
stimulation and the inhibitory physical barriers (including empty 


* Smith (153) found that certain peculiar “‘premonitory” behavior was followed bY 
the full seizure in more than 80% of his 1309 rats. This frequent Coincidence would make 
plausible the inclusion of the Premonitory behavior as the first Part of the Isivé 
pattern. Using this as the indicator of onset, many attacks were observed to begin wit 


one second after the beginning of the auditory stimulation. The mo 


sive components were observed to h i - i ; 
Bet tved to have latencies of 1-7 sec, in 50% of his cases (cf. 13 
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Spaces). More direct support of the conflict (vs. auditory) hypothesis 
is found in the results of two experiments in which sound was apparently 
not a factor. Griffiths (56) describes in considerable detail convulsions 
Occurring in a rat forced by electric shock to respond in a difficult 
choice situation, and Bitterman (14) reports one seizure in a learning 
Situation in which shock was the punishing agent. If additional ex- 
Perimental evidence of this nature is forthcoming, and if it cannot be 
attributed to such an alternative factor as dietary deficiency or elec- 
trocortical shock, the existence of the “conflict-induced convulsion” 
category will become more logically acceptable. 


Sound-induced Convulsion (‘‘Audiogenic Seizure”) 


Although the physiological explanation is still obscure, there is no 
question but that exposure to intense auditory stimulation will cause a 
large number of rats to exhibit a pattern of response deviating markedly 
from typical laboratory behavior (4, 38, 73, 86, 99, 109, 126, 130, 148, 
151, 154, 155).4 Sound sources successfully used include an air jet, the 
Galton whistle, buzzers and bells, jingling keys, and an oscillator. No 
Previous training is necessary, nor has any extrinsic factor but sound 
been demonstrated to be essential. With a latency varying from less 
than five seconds after the onset of stimulation to somewhat more than 
one minute, the previously “normal” rat begins a reaction closely 
reminiscent of the pattern described by Maier as experimental neurosis. 

he pattern is far from stereotyped, various combinations of wild 
tunning, tonic-clonic convulsions, and stupor being possible. 

This reaction was probably first mentioned in 1924 by Donaldson 
(28, P. 134), who, in describing the acute hearing of the white rat, noted 
the “almost maniacal running and jumping” that often occurred when 
a bundle of keys was jingled before the home cage. Investigators of the 
effect of magnesium deficiency in the rat described as early as 1932 a 
©envulsive sequence that occurred “spontaneously” or upon auditory _ 
Stimulation (51, 85), and Dice (27) reported a similar pattern in “epi- 
€ptic” mice exposed to the sound of jingling keys. It was not until 
1939, however, that this atypical response came to be widely known to 
Comparative psychologists, largely as the result of the controversy over 

© nature of the Maier-type “experimental neurosis.” Because of the 
®Pparent primacy of the auditory component in inducing the convulsive 
reaction, Morgan and Waldman (123) suggested the descriptive phrase 


th * Sound has also been observed at times to induce various “abnormal” patterns in 
ae human (e.g., “musicogenic epilepsy,” 23), the dog (23, 124), the cat (5), the guinea 
E (171), and the mouse (27, 117, 168). 
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“audiogenic seizure,” which has become the most widely accepted i 
term. 

The audiogenic seizure is influenced in both its pattern and frequen- 
cy of occurrence by a number of secondary factors, so that susceptibility 
figures within a colony and in a particular testing situation range from 
10% or less (86, 122) to well over 90% (34). The inhibiting effect of the 
following factors has been demonstrated: strenuous activity before or 
during the stimulation period (2, 76, 101); extreme restriction of move- 
ment (54, 88); the establishment of position habits (103) or the develop- 
ment of abortive jumping (93) when the stimulation is in the Lashley 
apparatus; the appearance of a fear reaction (2); previous exposure to 
the convulsion-producing sound (11, 38, 76, 99, 153); electrification of ~< 
the floor during the auditory stimulation (15); the provision of a “‘psy- 
chological shelter” (56); taming (76); temperature changes (101); 
chronic “emotionality” (11, 114); vagotomy (88); partial curarization 
(88); and the administration of such substances as adrenin (2, 87) 
dilantin sodium (21, 54, 148), adrenal cortex (54), thiamin hydrochlo- 
ride (59, 81), atropine (72, 73), B-complex vitamins and minerals (131, 

133, 134). The frequency or severity of scizure has been found to be 
increased by such means as repeated anger stimulation (163, 164); f 
increased intensity of auditory stimulation (48, 120, 121); chronic 
disturbances induced in other situations (75, 76); increased conflict, 
e.g., as provided by additional barriers to escape (99, 103); cutaneous 
stimulation furnished by water spray (97); temporary deprivation ° 
food and drink (38); lactation (127); perhaps prenatal treatment with 
sodium bromide (64); B-complex avitaminosis (133); magnesium de- 
ficiency (135); and injection of subconvulsive doses of strychnine 
153, 154), metrazol (78, 103, 106, 145, 154), caffeine sodium benz0at? 
(154), sodium phosphate (50), and coramine (166). Age difference? 
(31, 33, 39, 102), sex differences (33), genetic background (34, 39, 57, 

58, 95, 100, 113), blood composition (86), inanition (132), corticê 
insult (7, 153, 169), punishment for seizure (107), and eserine (72, ) 
and phenobarbital (77) injections have also been studied with respect 

to their effect upon audiogenic seizures. 

The nature and significance of the sound-induced seizure have bee? 
considered variously. The majority of investigators seem to be seckin 
an explanatory principle involving a rather direct overloading of tE 
nervous system by the auditory stimulation. A few (12, 93) still hol 


(153), “epileptoid convulsive reaction” (4), “noise-induced seizure” (72) “noise-fi8 
convulsion” (155), and ‘‘sonogenic convulsion” (117) ' 


] 
S 

s0 : ei a aaora gerzure 
ther terms applied to the behavior in question include “audio-epileptic seizury 
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the seizure to be a manifestation of strong conflict between opposed 
tendencies, and others (2, 56, 160, 164) emphasize the intense emotional 
components of the precipitating situation. It may be, as Patton has 
intimated (129), that the so-called audiogenic seizure is simply a mild - 
form of the disorders which clearly result from severe nutritional defi- 
cits. This would appear to be an experimentally fruitful suggestion, the 
Verification of which will probably wait on more refined neurophysio- 
logical and biochemical techniques. In any case, the apparently inti- 
Mate connection with convulsions otherwise produced makes it rela- 
tively meaningless at this juncture to consider the audiogenic seizure as 
an isolated phenomenon. 


Electrically-induced Convulsion 


Another method of producing convulsive behavior in the rat is the 
Passage of electric current through the skull (Ferrier, 1873, 172). The 
technique is analogous to human electroshock therapy, and the results 
are in part similar. The electrodes are variously applied: to the shaven 
skull (80), one to the head and the other to the tail (125), to the ears 
(29, 149, 155), to either side of the head between the eyes and the ears 
(48), or on a line one centimeter behind the eyes (162). The shocks 
€mployed to produce the generalized convulsion have included 10-20 
volts, 30-50 milliamperes, 0.3 seconds (66); 40 v., 20-25 ma., 2 sec. 
(80); 70-85 v., 0.2-0.6 sec. (29, 149, 155, 162); 120-145 v., 50-100 ma., 
0.1 sec, (174); 600 v., 4.5 ma., 2 sec. (125); and 700 v., 9-14 ma., 0.75- 

95 sec. (48). At these intensities a convulsion consisting of tonic 
flexion, often succeeded by extension and clonic contractions, occurs in 
most subjects, usually followed in 10-30 sec. by a period of hyposensi- 
tivity and reflex depression. This reaction is attributed by Kessler and 

ellhorn (79) to a hyperglycemic condition, in which the autonomic 
guance is on the sympathetico-adrenal side. Wortis et al. (174) suggest 
B the mechanism involves inhibition of the oxygen uptake of the_ 

rain (cf, 25). 
= Heilbrunn (66) has found that light ether anesthetization attenuates 

abolishes completely the electroshock convulsion, and according to 
age (125) alcohol tends to raise the shock threshold. On the other 
a atropine sulphate, synthetic vitamin K, calcium gluconate, 
Tomboplastic suspension of brain substance, and combinations of 
ue (66), and adrenalin (125) have been found ineffective in dimin- 

ung the effect of the shock. j 
ae hen a shock of decreased intensity or duration is substituted, the 

Neralized convulsive pattern is sometimes replaced by one resembling 
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the audiogenic seizure more closely, in that the tonic-clonic phase is 
preceded by a period of running. This running has been described by 
Page (125) as being more controlled and “‘less forced” than that typical 
of the audiogenic pattern. However, Golub and Morgan (48), Stain- 
brook (155), and Siegel and Lacey (150) seem to have succeeded in 
reproducing more exactly the running picture characteristic of the 
audiogenic convulsion. This production of the pattern attributed by 
some to conflict or to sound is presented as strong evidence that neither 
of these factors is as significant in the etiology of the running-convulsive 
behavior as sometimes implied. Such reasoning is not indisputable but 
at least suggests that a rapprochement of the differing viewpoints and 
experimental findings, perhaps at the neurophysiological level, may be 
mutually enlightening. 


Drug-induced Convulsion 


As early as 1864 convulsions were produced in experimental animals 
by the injection of foreign substances (Morcé, 172). With the wide- 
spread adoption of drug therapies, it was to be expected that a large 
number of investigators should turn to the rat in an effort to ascertain 
the nature and therapeutic mechanism of the convulsions. Metraz°o 
has been the most frequent convulsant studied (e.g., 68, 80, 89, 142). 
In one investigation (105), a 1% solution of metrazol in 0.85% saline 
was injected intraperitoneally, with an average convulsive thresho 
found in one group to be about 40 mg. per kilogram body weight, on 
the latency of attack approximately three minutes. Tonic flexio™ 
usually with torsion of the spine, is the most characteristic feature oft He 
convulsion, usually followed by a clonic stage. A passive phase term? 
nates the seizure, sometimes preceded by a brief period of rigidity $ 
stiff hopping. Recovery appears complete within a few minutes. The 
tolerance to the drug has been found to change with repeated admins 
trations, increasing when treatment is given on alternate days) ; j 
creasing when injection is weekly (143). The mechanism underlyi”® 
thewezure has been suggested to involve increased cerebral metabolis” 
and anoxia (25, 174), and increased autonomic (especially sympath? 
tico-adrenal) excitation (37). The discovery that rats maximally 
susceptible to sound-induced convulsions have on the average al v4 
cae pagam an 5 eom convulsion of longer latency and =y a A 4 
underlines the necessity n ra me a — a in t es? 
different experi of interrelating the patterns appearing ! 

perimental situations. 


; jes 
Insulin has also been employed as a convulsant. Berman and pie? 
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(9, 141) injected one unit per 12 grams body weight, which led in 
approximately 50% of instances to severe convulsion. Hypoglycemia 
1s very probably a causal factor (8), and inhibition of brain oxygen 
uptake is also probable (174). 

There is no reason to limit the list of convulsants administered to 
the rat to these two, as any list of pharmacological properties will 
testify (69). The fact that subconvulsive doses of many chemicals 
modify the convulsive response to sound or conflict enhances the likeli- 
hood that common physiological bases for the different “kinds” of 
convulsion may be discovered through this type of investigation. - 


Convulsion Associated with Dietary Deficiency 


Rats deprived of one or more elements of the vitamin B complex 
(17, 131, 132, 134), normal dietary magnesium (26, 124, 135), or cal- 
cium and vitamin D (50) become abnormally sensitive to stimulation. 
If the deprivation is begun early enough these rats will respond to mild 
auditory or cutaneous stimulation with behavior described as “‘epilep- 
tic” (19) or “uncontrolled” (84)—running fits accompanied or followed 

Y convulsions (24). Concomitant physiological changes are often 
apparent; Kruse eż al. (85) found that hyperemia and vasodilatation ` 
egan about four days after introduction’ of a magnesium-low diet and 
Subsided 12 days later, with the onset of seizures, death usually inter- 
vening within another four weeks. As the deficiency increases, con- 
vulsions are often observed to occur spontaneously, i.e., without obvious 

relation to particular environmental stimuli (52, 135, 136). 

The behavioral similarity of the attacks occurring under conditions 
of deprivation to the ‘“conflict-induced” convulsion and the ‘audio- 
enic” seizure might inspire a search for a common physiological etiolo- 
8y. This approach is still more strongly suggested by the sequence of 
vents as deprivation becomes progressively more severe. For example, 
With a diet including 1-2 mg. of magnesium per 100 g. of dry food, 
Convulsive seizures occur only upon application of a stimulus such as an 
air blast (51, 52). With only 0.4 mg. of magnesium per 100 g. of dry 
ood, convulsions occur in response to less and less intense sounds as the 
Period of deprivation continues (53, 85), and finally “spontaneous” 
Seizures may be observed (135). Pyridoxine deficiency runs a similar 
Course (136). (It is pertinent to note that sounds are by far the most 
effective stimuli for releasing these convulsive seizures.) One tentative 
explanation of the difference between various strains of rats in suscepti- 

ility to sound has been a difference of diet, which becomes an obvious 
Possibility in light of the above studies. Patton (129, 136) by implication 
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goes further. With a mildly deficient diet he obtains approximately 
100% convulsive responses to sound, and with more severe deficiency 
“spontaneous” seizures in nearly all rats; with a diet fortified by mine- 
ral and vitamin supplements the frequency of “audiogenic seizure’ is 
close to zero. It is no great extension of this relation to propose that 
underlying all sound-induced convulsions is a dietary deficiency, ar 
that the Maier-type “neurosis” is no more conflict-induced than were 
the symptoms attributed by Kruse et al. (85) in 1932 to magnesium 
deficiency. Until these possibilities are eliminated, it will be well to 
consider the general topic of “convulsions” as a whole, rather than by 
categories, evaluating each of the many factors that may modify the 
behavior (cf. Section IV). 


Il]. THE CONVULSIVE PATTERNS 


Before a decision can be reached as to the unity or diversity of the 
“types” of rat convulsions, it is obviously necessary to compare the 
patterns appearing in the several situations. It must be confessed that 
no very confident comparison can be made at present since because O 
the inherent variability of the patterns, or deviations reflecting testing 
differences, or inaccuracies of the observations, there is imperfect agree 
ment as to the precise form of behavior that characterizes each “type: 
Whether or not differences between the categories are reliably greater 
than differences within a single category, outweighing the similarities: 
probably will be settled only on the basis of refined photograph? 
techniques. 


; Convulsion Associated with Auditory Stimulation. Whether the more 
significant causal factor is sound or conflict, it is agreed that “neurotie 
or “emotional” convulsions in the absence of intense sound stimulati?’ 
have been described in only one rat (56), and this single descrip“, 
offers little detail. Further, the proponents of the neurosis ypothes® 
seem to impute “conflict” to any situation which includes inescapar’ 
stimulation of high intensity (c.g., 13, 93). It is therefore logical t 
combine under one heading the descriptions of the seizures attribut? 
by their observers to either variable. i0 
Just how complete a pattern of behavior should be included a$ aude 
genic or neurotic convulsion is open to question. Maier and to 
Morgans (122) originally incorporated the early uplind” running Tai 
their description, probably because of its unusual nature an gel 
oe i ea = ae without a preceding running, E “a 
2 j ollowe is practi t10: 
mediately arises as to the mri aes gi nouga ome oan whe? 
ypica running p@ avid 


it occurs alone or, for that matter, of the so-called premonitory 


& 
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or the substitute activity that so frequently precedes or takes the place 
of running. The criterion of attack will obviously require careful defini- 
tion before a truly satisfactory conception of the problem can evolve; 
numerous controversial issues have remained unsettled largely due to an 
unrecognized disagreement as to terminology. 

: Accepting the most complete sequence as constituting the convulsive 
Seizure, we find almost innumerable combinations of the premonitory, 
running, tonic, clonic, and passive phases. (For detailed descriptions of 
variations, cf. 7, 92, 99, 119, 153, 157.) Classifications have been sug- 
gested for certain combinations of symptoms (99, 157), but the reality 
probably approaches a continuum (or several continua) of severity and 
duration. Any generalized description, such as follows, is thus mislead- 
ingly simple, although possessing the merit of facilitating comparison 
with the convulsions evoked by electricity, drugs, and dietary deficit. 

After the initial startle response to the onset of the stimulus, any of 
a number of patterns may appear. Some animals appear to ignore the 
sound source, others seek to retreat from it, a few actively attack it. 
Many rats crouch motionless or try to burrow beneath the floor covering 
or their fellow subjects. In most nonseizure trials some sort of ‘‘sub- 
stitute behavior” appears—rapid nose or ear rubbing, teeth chattering, 
chewing, shivering, shaking, vibrissae twitching, body cleaning, or rest- 
less head and body movements (2, 88). In trials culminating ina seizure, 
behavior described by Smith (153) as a “motor aura” usually occurs, 
typically consisting of brief quick runs, jerky sidling or backward steps, 
or pivoting movements of the head and body (cf. also 63, 114, 151). 
After a latency ranging between one second and one minute (4, 7, 119, 
153), the active phase begins with a running period, sometimes of 
gradually accelerating speed, sometimes of a very explosive nature, and 
in any instance finally attaining an unusually high velocity. The run- 
ning may or may not follow a circular path, but almost always seems 
undirected with respect to obstacles such as furniture and other rats. 


This phase may be interrupted by one or more quiet periods, may termi- 
Nate suddenly in what resembles coma, or may continue with diminish- 
ing vigor until exhaustion. Most characteristically, it leads directly 
(after an average of 15-20 sec.) into the convulsion proper. This may be 
Predominantly of tonic extension, reminiscent of decerebrate rigidity 
(99, 114), of incoordinated clonic beating of the limbs, or may combine 
both tonic and clonic aspects. During the tonic-clonic phase (average 
total duration 30-40 sec.) the following are sometimes observed: 
Vocalization, exophthalmus, excessive salivation, cyanosis, ejaculation, 
diminished respiration, incontinence, dilated pupils, piloerection, re- 
traction of testes, relaxation of the vaginal opening, and peripheral 
vasodilatation, with death intervening in a very few cases. The rat may 
Continue to exhibit clonic jerks (eg, spasmodic jumping) with decreas- 
Ing frequency as long as stimulation continues, or may lapse into the 
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phase usually described as comatose. Most animals are very unrespon- 
sive to stimulation during this passive phase, with reflexes depressed to 
varying degrees. Spontaneous activity is virtually abolished, and the 
animal is nonresistant to handling. This condition may last for longer 
than 15 minutes, although gradual recovery of spontancous activity 
usually begins within two or three minutes. (The decreased activity 1S 
probably not simply the result of fatigue, for as Beach and Weaver (7) 
point out, the duration of the coma.is often inversely related to the 
amount of exertion occurring in the active phase.) In a few instances 
hypersensitivity or renewed clonic activity succeeds or takes the place 
of the typical stuporous condition (63, 117, 153). A recovering animal 
appears extremely fatigued, and quantitative recording reveals that the 
frequency of movement is subnormal for several hours (42). For some 
hours following a seizure the threshold to induced convulsion is markedly 
elevated, 

Accompanying this pattern of overt behavior, a number of internal 
changes have been detected. Lindsley, Finger and Henry (88) demon- 
strated alterations in the electrocortical activity paralleling the ex- 
ternal manifestations of the attack, and although technical difficulties 
obscured detailed analysis, many features in common with the EEG in 
human convulsive seizures could be recognized. The heart rate also 
reflects the progress of the seizure, fluctuating markedly just before the 
attack, reaching a high level during the active phases of the convulsion, 
and dropping far below normal and showing various irregularities during 
the comatose stage (47, 76, 88, 95, 109, 111, 112). The passive phase is 
characterized also by acidosis, decreased blood COs, and decreased 
clotting time of the blood (22). 

The only attempt to distinguish the pattern occurring in response tO 
sound alone from that resulting from auditory stimulation in a forma 
conflict situation is that of Maier and Glaser (99). They suggest that 
= the former situation the extension component of the convulsion 1S 
relatively more intense and the passive phase less pronounced. How- 
ever, these differences could easily be attributed to uncontrolled vart 
ables, especially age, differentiating the two groups compared, a’ 
Maier and Glaser emphasize that “the similarities are too great to con 
sider these differences as symptoms which distinguish between different 
abnormalities” (99, p. 12). 

Electrically-induced Convulsion. There are at least two different 
forms of seizure resulting from electrical stimulation of the brain, de- 
pending on the intensity of the current. At higher intensities, the onset 
of the current is marked by a “start,” with the spine flexed inward: 
hind legs drawn up, and front legs brought down (125). For the dura 
tion of the shock (conventionally 2 sec. or less) the animal remains rigid 
He may then pass into a state of tonic rigidity, with the back arche?: 
the tail stiff, the legs sometimes briefly flexed (79) and then extende 
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as far down as possible, and the eyes tightly closed. Gradually the fine 
tremors of this stage give way to clonic movements (involving princi- 
pally the hind legs, 48, 155) of decreasing frequency and increasing 
amplitude, with frequent urination, defecation, and sometimes ejacula- 
tion. After about 30 sec. coma supervenes. This varies in character, 
but usually begins with a very deep phase during which the rat is ‘non- 
resistant and the reflexes abolished even more completely than in the 
sound-induced seizure. After a few minutes there often succeeds a 
hyperexcitable stage during which “blind rage” reactions can be ob- 
served. As measured by maze activity, the recovery period is longer 
than in the sound-induced seizure (160). As soon as the animal begins 
to be active, a second convulsion can be elicited, in contrast to the 
longer refractory phase following the audiogenic seizure. 

With less intense shock, the tonic phase may be omitted, with the 
clonic and comatose phases following directly the primary jerk (48). 
The more characteristic pattern, and of particular interest because of 
Its similarity to the sound-induced seizure, includes a vigorous running 
Phase after the initial “start,” which may alternate with or be followed 
by tonic rigidity and/or clonic spasms before the coma begins. Golub 
and Morgan (48), Siegel and Lacey (150), and Stainbrook (155, 157) 
describe this seizure as resembling the audiogenic convulsion closely, 
although the latter observer notes a greater resistance to handling and 
More promptly returning activity in the postseizure state. On the other 
hand, Page (125) believes that the running that is induced electrically 
appears less “forced” than that following auditory stimulation. 

_ Physiological changes accompanying the electroshock convulsion 
include a relative cerebral anoxia, attributed to increased cerebral 
Metabolism (25), and extensive hemorrhages in the pia-arachnoid and 
in the parenchyma (66). : 
_ Drug-induced Convulsion. There are a great many drugs which, in- 
Jected with the proper dosage, produce convulsions in the rat (69). Be- 
Cause of the interest in the insulin and metrazol therapies for human 
ehavior disorders, most of the comparative psychologist’s attention 
has been directed toward these convulsants. For the most part, how- 
ever, the emphasis has been more upon the effects of these drugs oa 
learning and retention than upon the convulsive patterns themselves. 

Berman and Riess (9) injected 1 unit of insulin per 12 g. body weight. 
After about 40 minutes, lethargy and/or coma were observed. In nearly 
50% of the subjects severe convulsions were also observed, involving 
g ied the head region. Recovery appeared complete at the end of 

ours. 

Maier and Sacks (105, 144) give a more detailed analysis of the 


* Cf. the “sham rage” reactions of the decorticate dog and cat (5). It has been sug- 
m that the convulsive seizure may sometimes bring about a sortof temporary func- 
tonal decortication.” 
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metrazol reaction. The tonic spasm is the most characteristic sign of the 
syndrome, but two different patterns distinguish rats minimally and 
maximally susceptible to sound-induced seizures. In the first group; 
intraperitoneal injections are followed in 2-4 min. by a few twitches of 
the head and a sudden tonic flexor spasm involving the head and all 
four extremities, usually with spinal torsion. This is replaced by clonic 
movements of the forepaws, with the combined tonic-clonic period last- 
ing perhaps one minute. The rat then becomes relaxed and passive to 
handling (slow pulse, 47), with recovery apparently complete within a 
few minutes. In the group that responds convulsively to sound, the 
onset of metrazol convulsion is frequently delayed for several minutes. 
The tonic flexor spasm and forepaw clonus are usually present, but these 
stages are followed by a period of rigidity, during which the rat is ex- 
tremely hypersensitive to stimulation. There then usually occurs 2 
period of rigid hopping (as seen in the “neurotic” rats described by 
Maier, 92), sometimes leading to a second period of rigidity and re- 
peated convulsions. In both groups of animals it appears that repeate 
drug injections or, to a lesser extent, sound-induced convulsions, lead to 
an elaboration of the pattern as well as a decreased threshold of con- 
vulsive dosage. r 

Convulsion Accompanying Dietary Deficiency. Most of the studics 
describing convulsions in nutritionally deficient animals have utilize 
diets low in magnesium (53, 85, 124, 136) or certain elements of the 
B-vitamin complex (17, 24, 84, 134). The two patterns, whether pre- 
cipitated by auditory stimulation or occurring “spontaneously,” seem 
too similar to allow any clear differentiation. The patterns both typ" 
cally begin with high speed racing fits, often describing a circular pat a 
and frequently punctuated by vocalization. The rat may then fall es 
his side for a brief period, rigid and with breathing suspended. Ton 
spasms and clonic twitchings then usually occur. Kruse, Orent a 
McCollum (85) report no incontinence in magnesium tetany; Chic® 
El Sadr and Worden (17) observed both urination and vomiting during 
convulsions in rats deprived of Bg, which coincides more closely with a 
descriptions of sound-induced convulsions. Sometimes only the running 
“aspect is observed (17) and sometimes multiple seizures take place (© 
85). Usually a coma follows the active phase, marked by greatly de 
pressed reflexes and, as in audiogenic convulsions, a slow and irreg¥ 
pulse (17, 53). The total pattern, active and passive, usually lasts 7 
minutes, with the gradual recovery coming to the head region n 
Death terminates the seizure more frequently than is typical of t 
other convulsive states. j 

As Patton, Karn and Longenecker (134) point out, these seizure 
can be remarkably similar in outward appearance to sound-induct 
convulsions. To what extent the similarity extends to the physiolog!© 
level has yet to be determined. 
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On the basis of the best descriptions at hand, it would appear that 
the five “types” of convulsion have much in common, as far as outward 
manifestations are concerned. There is no reason to distinguish be- 
tween the convulsions precipitated by sound and those (if any) based 
on sound-induced conflict. While the pattern of response to high in- 
tensity electric current lacks the running phase characteristic of sound- 
or conflict-induced seizure, lower intensities do produce the running- 
tonic-clonic-coma sequence. Although it is not clear that these sequen- 
ces are identical, at least similarities are striking enough to inspire 
comparison at the neurophysiological level. Similarly, the metrazol 
shock includes components suggestively reminiscent of the audiogenic 
Seizure, although lacking the running phase. Finally, the descriptions 
of audiogenic and electrogenic seizures are virtually duplicated by 
those of attacks occurring at certain stages of dietary deficiency. There 
is little evidence in the overt picture to require a differentiation of this 
nutritional syndrome from the seizures produced by auditory or elec- 
trical stimulation in the presumably well-fed rat. 

The external resemblances do not of course establish the community 
of physiological or psychological mechanisms. The results of more 
Precise investigation may invest apparently slight divergencies with 
increased significance. Until then, however, we may well orient our 
experiments around complex patterns of variables which influence 
convulsions in general, rather than around the more limited number of 


etiologically designated categories. 
IV. VARIABLES INFLUENCING CONVULSIVE BEHAVIOR 


The resemblance of the convulsive symptoms induced under the 
several sets of circumstances is not the only hint that a consideration of 
convulsive-behavior-in-general may be fruitful. Several investigations 
make very clear that the appearance of one “kind” of convulsion in a 
given rat may be in part a function of experience with another “kind” 
of convulsion, that what appears to be a major factor in one “kind” isà 
Contributing factor in a second “kind” and vice versa—in short, that 
the interrelationships among the numerous variables are so complex 
that the intercategory lines are crossed too often to allow their strictly 
divisional meanings to be maintained. A few examples will clarify the 
Point, 

1. The amount of metrazol required to produce convulsion, as well as the 
Precise form of the seizure obtained, is a function of the susceptibility of the rat 
to sound-induced seizure (105). Further, both the threshold and form of 
Metrazol convulsion are altered by the experience of sound-induced seizures 
(144). On the other hand, subconvulsive doses of metrazol increase the likeli- 
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hood of sound-induced seizure, even though such attack still possesses the un- 
mistakable characteristics of sound-seizures (78, 103, 106, 145, 154). 

2. The percentage of a population susceptible to audiogenic seizures varies 
with diet (50, 133, 135). While convulsions in nutritionally deficient rats some- 
times appear “spontaneously,” sound stimulation releases them most readily 
(53, 134). 

3. According to some investigators, the convulsive reaction to sound is in- 
creased if conflict based on difficult discrimination is added (99) or if the animals 
have been brought to a condition of chronic emotional disturbance (75). Con- 
versely, sound stimulation has been found to be a peculiarly effective means 0 
inducing conflict that may lead to seizure (99). 


No behavior pattern can be said to be the result of a single causal 
factor. Every response is the product of an interaction between the 
complex environment and an organism whose immediate state has been 
influenced by a multitude of conditions. It is thus a questionable 
procedure to assign the responsibility for any reaction or type of reac- 
tion to one characteristic of the environmental-organismic situation- 
Nor is it invariably possible so to manipulate the relevant variables that 
even the major antecedent can be segregated. In the problem undef 
consideration, for example, it seems that according to one persona 
definition of “conflict” it is virtually impossible ever to provide an 
intense and continuing stimulus without concurrently inducing conflict 
(13). At the same time, the studies of Patton and his collaborators 
(127-136) make it clear that the dietary background of a rat can never 
be ignored; it is difficult to know with certainty that the diet of a pat 
ticular animal is adequate, that with improved nutrition the situation 
presently leading to convulsion might not lose its effectiveness. Like 
wise, it is not easy to limit the exteroceptive stimulation impinging o 
the possibly malnourished subject to a degree that it can be ruled out 
as a precipitant of seizures. For these reasons, as well as for those 4” 
ready stated, it seems defensible at present to avoid dogmatic etiologic? 
classification of convulsions, and simply to outline tha results of varia- 
tion of a number of parameters, hoping that this form of organization 


may lead to new insights regarding the physiological (or psychologic@ 
bases of the behavior. 


Auditory Stimulation 


With the exception of experiments involving the use of electroshoc# 
or drugs, the vast majority of situations producing convulsions in t 
rat have included some form of relatively intense auditory stimulatio™ 
This is true even of certain investigations designed ostensibly to demo?” 
strate the role of conflict (e.g., 92) or dietary deficiency (e.g., 165) in the 
production of seizures. In the early studies it was not clear which phy” 
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cal characteristic of the sound stimulus was primarily relevant. Maier 
and Glaser (99), finding jingling keys a more effective stimulus than the 
tinging of an electric bell, an air blast, or high pitches produced by a 
Galton whistle, even though to the experimenter’s ear it was the least 
loud, concluded that “the degree of effectiveness depends more on the 
quality of the noise than upon the intensity” (99, p. 28). Maier (95) 
adds later that the optimal quality also varies from colony to colony. 
On the other hand, it has been shown that with sound frequency held 
constant, the number of seizures increases with increased intensity, 
while with intensity unchanging a positive relationship obtains between 
sound frequency and number of seizures (121). Carrying the analysis 
further, Morgan and Galambos (120) have demonstrated that the effec- 
tiveness of an auditory stimulus depends not upon its absolute intensity 
level, but upon the sensation level (i.e., decibels above the rat’s thres- 
hold for the frequency involved), and that even low pitches, if of suffi- 
cient loudness, can produce seizures. It is therefore reasonable to 
Presume that the greater effectiveness of the jingling keys was dependent 
upon the presence of overtones of high frequency, inaudible to the hu- 
man ear but falling within the range of maximal sensitivity of the rat 
(cf. 49 for the rat’s audibility curve). It is unfortunate that so few in- 
vestigators have attempted any objective analysis of the sound pattern 
employed, but instead have depended for calibration upon the “experi- 
menter’s ears” (97, p. 285). This reduces the meaning of such state- 
ments as “the water test offered much less auditory stimulation than 
did the key jingling or air hiss tests” (97, p. 285), and vitiates the con- 
clusion that in such instances the auditory component is a negligible 
factor. Similarly, it may be the uncontrolled variation of loudness, 
rather than increased conflict, that accounts for increases in frequency 
of seizure when the subject is tested in a discrimination situation (92) or 
in an enclosed testing chamber (99). It would seem necessary to assume 
for the present that the effect of sound depends predominantly upon its 
Sensation level (for the rat), and that this characteristic must be con- 
walled before less measurable aspects of the situation are considered 
ausal, ‘ 

It must be added that the intensity required to precipitate a seizure 
Varies from individual to individual, and within the same subject from 
time to time. Greenberg and Tufts (53) point out that as the rat be- 
Comes more deficient in magnesium, a less intense auditory stimulus is 
Tequired to set off the convulsion, and others have found similar results 
with subconvulsive injections of metrazol (106). It may be expected 
that a number of factors which influence the reactivity of the receptor 
and nervous systems will vary the effectiveness of a constant auditory 
Tau (cf. many of the following subsections, and 75, 92, 99, 122, 123, 

-136). 

The total energy of the auditory stimulus is not always related to 
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the likelihood of seizure. The number of convulsions is no greater with 
continuous sound than with alternating 1-second periods of sound and 
silence, although with longer cycles the interrupted tone becomes less 
effective (45). It may be significant from the standpoint of threshold 
mechanisms that the latency of attack is related neither to intensity 
(119) nor to temporal pattern of stimulation. 

There has been no experimental demonstration that it is stimulation 
of the cochlear receptors that is a determiner of convulsive behavior, 
Because of the intimate anatomical relation between vestibular and 
auditory receptors, it is not impossible that the vestibular function is 
the more important here. This possibility is perhaps strengthened by 
the finding that swinging the rat during sound stimulation increases 
the number of seizures (75), by the frequent observation of peculiar 
circular or pivoting movements just prior to the attack proper and the 
occasional circular pattern of running, and by the depression of the 
righting reflex in the postconvulsive stage. Then, too, it may be the 
action of cutaneous or pain receptors, particularly in the middle eat 
that is the really significant feature of the “auditory” stimulation. 
There is need for a more clear delimitation of the necessary stimulation 
modality, probably by pharmacological or operative techniques. 


Electric Shock 


Electric current has been used in several types of rat problem, some- 
times as a motivator, sometimes as a punishment or pain-producing 
agent, sometimes to bring about specific physiological effects. he 
behavior evoked depends upon the intensity and duration of the shock: 
the point of application, and the condition of the animal. To y!e 
convulsive response one may, for example, pass a current of 600 v. a” . 
4.5 ma. through the skull for an interval of 2.0 sec. (for other combina 
tions cf. supra, Section II). Such stimulation usually leads to 4 tonic 
clonic seizure of perhaps 30 seconds duration (125), followe 
comatose stage from which recovery gradually takes place over 4 pene 
of several minutes. With a less intense shock, the tonic-clonic phas? E 
‘often preceded by the disorganized running that characterizes h 
sound-induced seizure. l 

Heilbrunn (66) attempted to abolish the electroshock convulsion a 
drug injection just before stimulation. Results were negative with t™ 
following: atropine sulphate; synthetic vitamin K; calcium glucona’ g 
thromboplastic suspension of brain substance, with or without calo ai 
gluconate; synthetic vitamin K and the suspension of brain substan i 
and vitamin K, brain substance, and calcium gluconate. He was 
to diminish the severity of the seizures, or to abolish them comp! 
by light ether anesthetic. 

When the shock is applied to the rat via other pathways, such as 
legs, convulsions do not normally result (6, 10, 15), although ©* 
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fatigue and passivity may supervene with continued stimulation (30) 
and fear and resistance to the situation may accompany repeated 
trials (29). When shock applied to the feet has been used to force the 
Tat to respond in a difficult discrimination situation, convulsions have 
virtually never occurred (10, 56, 92, 123). Only Griffiths (56) has been 
able to produce convulsions in such a situation (one of a group of 10 
Tats), which, it has been suggested by Stainbrook (157), might have 
been actually the result of cerebral electroshock. The fact that mild 
Peripheral shock is one of several stimuli which have been successfully 
applied to evoke convulsions in animals with a variety of dietary defi- 
Clencies (50; but also cf. 135) suggests an alternative explanation of 
Griffiths’ findings. 
Electric shock has been coupled with auditory stimulation in several 
experiments. Page (125) found that the convulsive threshold to cere- 
ral electroshock was not altered when the rats were tested ‘while they 
Were exposed to the sound of air blasts” (p. 186). Maier and Glaser 
(101) electrified the floor of the test chamber every 2 seconds while 
Susceptible rats were being stimulated by the sound of jingling keys. 
The number of responding animals and the total number of seizures 
Were essentially the same as for control periods of sound stimulation 
alone. Morgan and Waldman (123) likewise found that the addition of 
Shock to a sound plus conflict situation had no appreciable effect on the 
number of seizures: The results of Bitterman and Warden (15) seem to 
contradict these observations, for the distraction of the shock decreased 
the number of seizures exhibited by their rats. However, the results of 
their control trials reduce the significance of these findings. 


Other Stimuli 


There has apparently been no demonstration that other extero- 
Ceptive stimuli are capable of eliciting convulsive behavior, except in 
Conjunction with certain factors of an atypical nature. Unusual sensi- 
tivity to noise is reported by Chick, El Sadr, and Worden (17) in rats 
reared on a dict low in vitamin Beg: “the slightest external stimulus 
Seemed to precipitate” seizures quite similar to audiogenic seizures 
(p. 579). In some of these rats immunity to noise developed, but 
Seizures could still be induced by stroking, by pricking the tail, or by 
Placing the animal in an unusual position. Similar observations are 
Teported by Patton, Karn and Longenecker (134). J. R. Hamilton (65) 

escribes convulsions lacking the running component that have ap- 
Peared over a period of several years in one laboratory strain of rats. 
Olses may precipitate the attacks, but tactile stimulation is most 
effective. On the other hand, stimulation of an intense nature inhibits 
€se seizures. 

Other forms of stimulation have been found to affect the capacity of 

Sound to induce seizures. Humphrey and Marcuse (75, 76) present 
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evidence that swinging or rotating the animal during sound stimulation 
may increase its susceptibility to seizure, and Maier and Feldman (97) 
have demonstrated that a water spray has a similar effect. In the 
opposite direction, immersion in warm or cold water may reduce the 
number of sound-induced convulsions (101), although the significance 
of this experiment is obscured by concomitant exercise of a strenuous 
nature, Stimulation by bright light is also said to inhibit seizures (92). 
Whether the modifying effect of these several stimuli is mediated by the 
production of changed emotional status or operates in a more direct 
fashion cannot“be decided on the basis of the experimental data now 
available. 
Age Differences 


With increasing age, a number of vital physiological alterations take 
place in the organism. Which of these changes modify the responsive- 
ness of the rat to sound-induced seizures is not known,” yet there is n0 
question but that the degree of susceptibility is in part a function © 
age (27, 31, 39, 102, 115). The number of responding rats in a colony 
seems to increase during the first months of life, reaching a maximum 
at about 135-150 days and declining thereafter. There are, howeve? 
individual exceptions to this trend, with some rats (maintained on 
standard colony diet) first exhibiting convulsive behavior at 250 days 
or later. Although a decrease in susceptibility is the normal accompa 
ment of aging beyond about 150 days,* some animals continue to demon 
strate the full pattern at 500-600 days. The question of the relation 
between severity of attack and age is not clearly settled. In young ë 
rats the seizures seem to be principally of the running type, with th 
more severe convulsions appearing more frequently with age incremen 
(34, 130). Smith (153), however, reports more deaths during seizur® È 
the younger subjects. According to Maier and Glaser (99), rats i 
seven months of age have fewer attacks of the ‘‘extensor type” than 
younger rats. t 

The age of the rat at the time when withdrawal from or supplere” g 
ito the diet is made is of importance in influencing the occurrence, g- 
sound-induced or “spontaneous” seizures (cf. Nutritional Factors |, 
low). A diet lacking in pyridoxine may fail to produce convulsions wa 
mother, while the offspring will exhibit severe disorders during lop” 
nursing period (134). Too, it appears likely that the earlier in ever i 


1 Alterations in the rate of metabolic processes, the changing endocrine palance: ate 
number of intestinal flora capable of synthesizing vitamin B (134), the atypical heart at 
development of the rat (111), and the variations in auditory sensitivity might erele 

8 Farris and Yeakel (33) trace a decline in the susceptibility of albinos begin”! n to 
early as 60 days. The exact time relations will undoubtedly vary from investigati? net” 
investigation, depending on such uncontrolled factors as diet, strain of rat, mode 0 t the 
ing, etc. For example, the same experimenters (34) find that in gray Norway rats 
percentage of attacks increases up to the age of 80-100 days. 
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ment a supplement of vitamin B-complex concentrate is given, the more 
complete the protection against sound-induced seizure (128). 
Inasmuch as susceptibility has been shown to vary markedly with 
age, and in a highly individual manner, it is essential that particular 
effort be made to control this factor in investigations attempting to 
ascertain the effect of other variables upon seizures, especially where a 
wide age range is to be covered. £ 


Sex Differences 


There is little clear-cut evidence concerning sex differences in sus- 
ceptibility to conyulsion. Morgan (119) found no reliable difference in 
the average latency of audiogenic seizure between the two sexes, and no 
Statistically significant differences in over-all incidence has been re- 
Ported (e.g., 7). But in spite of the lack of conclusive data in any single 
Study, the trend of all available figures is in the same direction—slightly 
More males than females respond convulsively during auditory stimula- 
tion. Unpublished data of the writer show 18.7% of 268 unselected 
males and 11.6% of 112 females susceptible over a series of four tests, 
Maier and Glaser (102) report 55.7% vs. 52.3%, and Farris and Yeakel 
(34) and Finger (39) found differences of a similar order. Farris and 
Yeakel (33) suggest that males may react more frequently early in life, 
With responses of females predominating later. Although females 
might logically be expected to be more variable in their response because 
of their periodic physiological changes, it is impossible without further 
experimental data to assign greater total susceptibility to either sex. 


Differences in Genetic Background 


Strain differences. A number of workers have noticed differences in 
Convulsive responsiveness among rats of different strains. It seems quite 
Clear that domesticated gray Norway rats are much different from al- 
binos in their response to sound (31, 115). In the first place, their startle 
is appreciably greater (36). Then, the number of animals susceptible is 
Significantly higher, in one instance exceeding 95% while a comparable 
group of albinos yielded less than 70% of susceptibles (34). The average 
Percentage of trials per rat giving seizures is higher for the gray Norways, 
the increase with age ‘continuing longer and the subsequent decrease 
Postponed several weeks. Finally, more seizures among the grays, both 
relatively and absolutely, include the convulsive phase, as contrasted 
to the predominance of running among the albinos tested with the same 
auditory stimulus. 

The results with wild gray rats are decidedly in contrast. Griffiths 
(58) was not able to induce a single seizure in 126 wild Norway and 15 
wild Alexandrine rats, using air blast, jingling keys, and Galton 
whistle successively as sources of sound. Unless these negative results 
Can be attributed to an insufficient intensity of stimulation, we must 
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explain the difference between the responses of the wild and domesti- 
cated groups in terms of such uncontrolled factors as age, dietary back- 
ground, and emotional state. 

Differences in inherited “emotionality.” A second genetic difference 
influencing susceptibility to sound relates to Hall’s concept of emotional- 
ity (61). His primary measure of emotionality is the amount of defeca- 
tion and urination displayed by the rat when placed in an unfamiliar 
open field of 8-foot diameter. The tendency to react in this emotional 
fashion seems to be inherited (62), and rats bred from a so-called emo- 
tional strain display fewer convulsive attacks in response to sound than 
do rats from a nonemotional strain (11, 113, 114; cf. also infra, p. 223). 

Differences in familial susceptibility. The greater part of the evidence 
points to the conclusion that, with other factors held constant, rats 
with susceptible parents are more susceptible, on the average, than are 
offspring of nonsusceptibles. For example, by selective breeding over 4 
number of generations, one strain can be produced with a relatively 
higher degree of susceptibility, a second with a much smaller percentage 
of convulsions (57). The precise mechanism of genetic transmission 15 
still in doubt, however. Dice (27) and Watson (168) concluded that an 
“epileptic” pattern in mice, apparently somewhat similar to the sound- 
induced seizure in rats, was transmitted as a Mendelian recessive trait. 
Maier and Glaser’s early study (100) concluded that the susceptibility 
to convulsion behaved asa simple Mendelian dominant, but the data 0 
Maier’s later experiment “do not support a single Mendelian type ° 
transmission” (95, p. 331). The entire hereditary background of a 
particular rat must be considered in predicting his susceptibility : 68.670 
of five litters with nonsusceptible parents and grandparents, but from 

n “unstable strain,” were found to convulse to sound (95; cf. also 57)- 
The source of much of the confusion concerning this matter arises from 
the neglect of other relevant factors, such as age. As has clearly been 
shown, the diagnosis of “nonsusceptible” made on the basis of test 
over a limited span of life is misleading, for a rat failing to respond at the 
age of 20 weeks may respond for the first time at 40 weeks (39). Geri 
4ainly the question of genetic transmission will not be settled unt} 
more investigations are made, with systematic control of the other 
pertinent variables. 


A little information is available indicating that convulsive respons? 
to metrazol is related to the inherited susceptibility to sound alone (93); 
This may be further evidence of the unity of the several “kinds fa 
convulsion in the rat; additional study may reveal the inheritance o 
constitution with a certain level of general convulsion-tendency. PA 

Differences in “nervous tone.” In one group of rats having exper” 
enced a number of audiogenic seizures, the systolic blood pressure ke: 
found to be abnormally high (115). Inasmuch as the majority of at 
animals were from a single litter, the experimenters hesitated to asc™ 


te 
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the hypertension to the occurrence of seizures. Rather, “audiogenic 

seizures may be simply a measure of that factor in the animal’s constitu- 
‘ > «oy 

tion (‘nervous tone’) which causes the hypertension” (p. 303). 


Differences in ‘‘Emotionality” 


As indicated above, rats from a strain bred for ‘‘emotionality” (as 
measured by response in the open field test) are less susceptible to con- 
vulsion when exposed to sound stimulation than are rats from a non- 
emotional strain. When the rats themselves are tested for emotionality 
in the open field, a higher percentage of those classified as nonemotional 
are convulsers, the correlation between the two patterns of behavior being 
—.S7 (11, 113). It has been suggested that the chronically emotional 
animals have an habitual “safety valve” which allows the constant 
dissipation of tension rather than its building up to the threshold of 
Convulsion (114). Requiring a somewhat different interpretation is the 
discovery that “sentled” rats have fewer sound-induced seizures than 


do untamed rats (76, 110). 
Conflict and Chronic Disturbance 


It would be difficult indeed to define “conflict” to the satisfaction of 
all psychologists and in such a manner that there would be unanimity 
as to the conflict value of a particular situation. Eschewing this task, I 
Shall simply quote the word as it is used in certain experimental reports; 
some readers will desire to insert the modifier “alleged” at several points. 
The controversial literature may shed additional light on the semantics 
of the situation (e.g., 13, 40, 41, 56, 92, 96, 118, 123). 

It has already been noted that a few investigators impute conflict to 
every situation in which an intense auditory stimulus evokes convulsive 
behavior (e.g., 13). From this standpoint, the material reviewed above 
under Auditory Stimulation becomes immediately relevant, with the 
substitution of “conflict” for “sound.” Beyond that, there are a number 
of examples in which a factor termed “conflict” has been examined as 
to its effect upon convulsive behavior. It would be very convenient for 
the student of conflict, incidentally, if sound-induced seizures were 
demonstrably independent of conflict. In such an instance, the facilitat- 
ing influence of deliberately provoked conflict upon a subthreshold 
Convulsive tendency (with a standard auditory stimulus) could become 
a measure of the degree of the conflict itself. The sound-stimulated rat 
Would then become an assay object for conflict just as it might for an 
Inhibiting or facilitating drug. And similarly for several others of the 
factors considered in Section IV. 

As has been pointed out, Griffiths (56) describes convulsions in a rat 

orced by an electric current to choose between two negative cards in a 
Lashley jumping apparatus. The dominant factor as explained by the 
€xperimenter is stimulation of the emotional centers, with “Fear an 
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escape . . . the basic principles underlying the seizures” (p. 29). The 
proponents of the conflict hypothesis on the other hand could describe 
the crucial factor as being conflict between the necessity of reacting and 
the unavailability of an appropriate response.* No other example of 
convulsion resulting from non-auditory conflict is in the literature. 

Conflict brought about by the presentation of a difficult discrimina- 
tion problem has several times been reported at least to increase the 
likelihood of seizure in response to sound. Maier and his collaborators 
(92, 99, 103) found that a greater number of seizures result when the 
animal is faced with a one-window situation in the Lashley jumping 
apparatus and at the same time stimulated by an air blast than when 
stimulated by air blast alone. The greater number of seizures is at- 
tributed to the addition of the conflict based on the jumping problem- 
(On the other hand, it is interesting to note that in the conflict-plus- 
sound situation the percentage of convulsions, contrasted to simple 
running attacks, may be smaller than in response to sound alone. 
Using a small number of rats in a similar experiment, Morgan an 
Waldman (123) obtained negative results. 

Other features of the testing situation are said to add to the conflict 
and thereby increase the number of seizures in response to sound. Erect- 
ing walls on the testing platform produces more convulsive seizures, an 
enclosing the subject on all six sides leads to an additional increment 
(99). Whether the greater effectiveness of this arrangement is due tO 
alterations in the auditory component or in the conflict aspect is sti 
debated. When a rat stimulated by sound is simultaneously stimulate 
by a powerful water spray, more convulsions occur than when the non- 
auditory effect of the spray is lacking (97). These results have been in~ 
terpreted as meaning that conflict between the need to escape the 
annoying spray and the inability to do so increases susceptibility to 
convulsion. That the facilitating effect of the water depends’ on more 
than its cutaneous stimulation value is yet to be established. 

Two other experiments may be interpreted as supporting the con” 
tention that the incidence of sound-induced convulsions can be increas 
by conflict. Bernhardt, Tobin and Signori (10) were unable to produc? 
convulsions in a difficult Lashley discrimination situation even thoug 
response was forced by electric shock. When rats exposed to this pro 4 
lem were subsequently tested with a loud bell, however, they exhib! 
more seizures than did a control group lacking discrimination training: 
Humphrey and Marcuse (75, 76) produced “chronic disturbances 
rats by noisily moving the bottomless feeding box after completio? 
maze trials. The animals so treated convulsed more readily in respo? 
to noise than did a control group. Small numbers and incomplete ©° 3 
trols diminish the significance of these experiments, however. 


9 Alternative explanations are mentioned above. 
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Anger and Fear Reactions 


Tseng (164) reports that the induction of a prolonged anger state by 
teasing increases the likelihood of a rat’s convulsing upon exposure to 
loud sound. In the same laboratory, Arnold (2) observed that on trials 
in which “fear” reactions (tremor, twitching, lip-wetting) occur there 
Is decreased probability of seizure. That injections of adrenin reduce 
Susceptibility is presumably related to this finding; the adrenin induces 
the fear symptoms, which becomes a substitute for convulsion. 


Other Behavioral Relationships 


Substitute activities. If under “convulsion” is subsumed only the 
running-tonic-clonic pattern, then an inverse relation obtains between 
the appearance of convulsions and certain other forms of response to 
Sound. In nonsusceptible rats, or on the nonseizure trials of suscep- 
tible rats, the onset of the auditory stimulus is frequently followed by 
face rubbing, ear scratching, body cleaning, sniffing, or chattering 
(2, 56, 88, 97, 151). Inasmuch as seizure virtually never ensues if such 
forms of behavior occur, they may be classified as substitute forms of 
response—in one sense, a means of protection against convulsion. ` 

Other forms of alternate activity have been described as charac- 
terizing non-convulsers: ‘“haphazard’’ movements (4), increased motor 
activity such as climbing (10), exploration (2), attack of the sound 
Source (58), general ‘‘nervousness’’ (87, 92). Bitterman and Warden 

) point out that adding mild shock to sound stimulation not only 
Produces more activity in the rat than results from sound stimulation 
alone, but also is followed by fewer convulsive attacks. It has been 
suggested that such forms of behavior, as well as chronic emotionality 
(11), fear reactions (2, 3), and forced exercise (76, 101), drain off the ex- 
Citation provoked by intense stimulation before it can build up to an 
explosive level (161). Yet to be discovered are the factors which pre- 

ispose an animal to these responses rather than to seizure, 

Escape reactions. When the rat is forced (e.g., by an air blast) to 
make difficult discriminations in the Lashley apparatus, the well de- 
veloped response is often disrupted by the appearance of position 
habits and inaccurate (“abortive”) jumping. In the former, no attempt 
1s made to select the correct stimulus card; instead, the jump is always 
to the same side. In the abortive reaction, the animal gradually loses 
his skill in reaching the window, hitting above, below, to one side, or 
even falling short of the food platform. These responses appear par- 
Ucularly when the negative stimulus is presented, and ordinarily can be 
eliminated by retraining. Maier (93) and Maier and Klee (103) report 
that the number of seizures produced by conflict plus sound diminishes 
when an animal develops either of these modes of jumping. The ex- 
Planation suggested is that these reactions offer a means of escape from 
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the problem situation. An animal for whom the situation no longer 
involves choice (fixated position habit) or for whom punishment for 
incorrect choice is decreased (abortive jumping) experiences less intense 
conflict, and so, granting the facilitating role of conflict or emotional 
upset, would be expected to exhibit fewer convulsions. 

The possibility of escape of another sort was provided in a sound- 
stimulation experiment lacking the discrimination problem (56). In 
this situation, the rat was exposed to the loud sound of a bell and during 
the running phase of the seizure was given the opportunity to run into 
a small “rat-hole.”’ It is reported that of the rats entering this shelter, 
fewer continued into the convulsive stage of the seizure than had done 
so in trials prior to the introduction of this feature. The results are 
interpreted to mean that the rat’s “affective tone” is a factor in deter- 
mining convulsive behavior, and that the ‘psychological shelter” 
response makes the badly frightened rat feel more secure. Evaluation 
of the observations is difficult without further information regarding 
the control of such factors as sound intensity and adaptation. 

Activity and restraint. Numerous observations have been made 
which relate the probability of sound-induced seizure to the amount 0 
gross movement allowed or required during or just preceding the 
stimulation. If the animal is forced to move vigorously during this 
time, the convulsive responses decrease in number (76, 101) or are 
inhibited completely (94). On the other hand, placing the rat in a 
relatively small cage, so that movements tend to be restricted, is Te- 
ported to increase the percentage of seizures (99, 101), although the 
meaning of these data is somewhat obscure because of small numbers 
and limited controls. This trend apparently breaks down upon see 
creasing the degree of restraint still further, for binding the subject 
during auditory stimulation reduces (72, 111) or eliminates (56) the aP? 
pearance of seizures, as well as certain physiological manifestations ° 
abnormal response (88). It may be that while the relative restraint of a 
small cage reduces neuromuscular activity, the restraint of close binding 
actually increases neuromuscular activity in the form of isometrC 
contractions against the bonds, thus providing as much of a “safety 
valve” as does forced exercise or substitute behavior. It would be 
enlightening to determine if habituation to such restraint leads to the 
reappearance of convulsive symptoms. 


Nutritional Factors 


The precise influence of nutritional factors upon convulsions ine 
duced by electroshock, sound stimulation, or conflict is difficult te 
determine by the average psychologist because of his lack of technic? 
knowledge concerning the laboratory diet. It is often assumed th 
Purina chow, perhaps supplemented occasionally by a ration of spinac i 
is adequate for maintaining the rat at his physiological optimum, a” 
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at for thi 3 
relevant = ee the dietary factor can be disregarded as a variable 
formidable — behavior. There is accumulating, however, a 
completeness: > on the effect of either extreme of dietary 
Stituents ate ani vulsions are very readily induced when certain con- 
with the provisi nimal, and become increasingly difficult to provoke 
extremes a. of generous supplements. As the influence of the 
intermediate pe A obvious, one is led to suspect that in the 
tritional stat con ition—that of “standard” colony feeding—the nu- 
more deid is an omnipresent determiner. It would appear that 
eyond studie —— of this factor is justified, extending even 
tivation, s of convulsions to such problems as learning and mo- 
Ma ; f 
rats OF seg ire Some fifteen years ago it was reported (85, 124) that 
Wardly ou en dict frequently exhibited convulsions ‘out- 
genic,” Furs to the seizures later described as ‘‘neurotic’’ or “audio- 
amplified our D experiments (50, 51, 52, 53, 135, 136, 165) have 
convulsive re knowledge of this phenomenon. If a group of rats, free of 
includes onl noni even to intense sound, are put on a diet which 
and body J 5 mg. of magnesium per 100 g. of dry food, the plasma 
lays, and hierar levels drop, vasodilatation is observed for a few 
rats will axhihie state of hyperexcitability begins. In this condition, all 
Sound (but gi it the running-tonic-clonic pattern when stimulated by 
Stage the re ae when such stimuli as shock or light are used). At this 
in most of | sap aa of magnesium in the diet alleviates the symptoms 
iet (en, 0.4 animals (136). When smaller amounts are included in the 
Nduced oor mg. per 100 g. of food), the convulsive behavior may be 
Occur in the ab, is somewhat more severe in form, eventually comes to 
the additi absence of any demonstrable stimulus, cannot be cured by 
With 5 on of magnesium salts, and often terminates in death (53). 
è unt or more of magnesium per 100 g- of food, some rats exhibit 
When cal sions only under special conditions, ¢.g., when lactating or 
alcium is added to the diet. Supplements of vitamin G delay the 


aPpea 
oe of the syndrome (53). 
asan ta In 1938, Kline, Tolle and Nelson (84), employing the 
Maintained T object for vitamin Bı, described a polyneuritis 1n rats 
early stage oh 25-50 days on a diet deficient in this vitamin. In the 
s “a state of extreme contraction of the musculature” was 


i 


Dsery, 
ed; i P > 
trolled mei “the animal . . . [exhibited] almost continuous uncon- 
ements” (p. 309). A 2-unit administration of the vitamin 
More closely 


hin 24-48 hours. 
tern was that observed in 


ts deprived of vitamin 


Caused 

resembling symptoms to disappear wit 

19 by ae sound-induced convulsive pat 
le El Sadr and Worden (17) in ra a 

US seemed zures often occurred daily, and “the slightest external stimu- 

addition of to precipitate them” (p. 597). Cure was effected by the 

of Bs in adequate amounts. A number of later studies (19, 24, 
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59, 127, 132, 133) have verified that the lack of normal B-vitamin 
content in the diet brings about susceptibility to sound-induced con- 
vulsions, and that in extreme cases ‘“‘spontancous’’!® seizures occur 
that are indistinguishable in form from the audiogenic seizure. Con- 
versely, almost complete protection against audiogenic seizure is af- 
forded by the inclusion in the diet of large amounts of vitamin B, although 
the supplement is maximally effective only if provided early in life. 
There are a number of complications that make difficult an exact 
statement of the role of the vitamin B complex in this behavior. Hun- 
ger (38, 121) and inanition (81, 132) themselves affect susceptibility 
to sound-induced seizure (each function passing through a maximum), 
and a subnormal vitamin level usually decreases food intake. Further: 
the effects of inanition and vitamin deficiency may cancel each other 
out when they are present simultaneously (131). Multiple vitamin 
deficiencies may similarly counteract each other, and supplementary 
feeding with other vitamins at times increases the severity of attack in 
Be-deficient animals (134). Finally, restoration of food after a period of 
fasting leads first to an increase and then to a decrease of seizures (132): 
But certainly it appears that the dietary factor is an extremely impot- 
tant one in the determination of convulsions, and may account for the 
differences in susceptibility accompanying differences in age, strain, 
and colony background. The importance of this variable in unsuspecté 
cases is enhanced by the observation that a deprivation which has 9° 
effect upon adult females may still induce susceptibility in their suck- 
ling young (24, 128, 134). ’ d 
Other constituents. Other nutritional factors have been investigate 
for their effect upon response to auditory stimulation. Rats maintaine 
on a vitamin D-deficient diet for the first four weeks of life were wc! 
more susceptible to sound than were a group given the standard labo 
ratory diet (153). Increments in frequency of convulsive attack may 
result from parathyroidectomy and low calcium intake (50). : 
Although the mechanisms by which nutritional factors modify 
convulsive susceptibility are not yet known, it is clear that their 19° 
fluence upon the rat’s physiological condition must constantly 
1° Whether or not an auditory stimulus is essential as a precipitant of convulsions 4 
the malnourished rat is not clear. Patton, Karn and King (133) observed that the one 
of seizures is often delayed until 1-5 min. after termination of the auditory stimu ne 
which may account for the “spontaneity” of some cases. Only the deafening of va 
exhibiting allegedly spontaneous seizures can rule out this type of etiology, or strength 
the unique position of the auditory modality in the production of seizures. It ane 
relevant to note that Galambos and Davis (44) found about half the fibers in the nate 
auditory nerve firing “spontaneously” even when precautions had been taken to eli a8 
inate ambient noise. Incidentally, the overt manifestation of seizures in the discrimi" 
tion situation has occasionally been observed to have its onset several -seconds after 
termination of the auditory stimulus (93). 
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considered by the investigator of seizure behavior. The manipulation 
of this variable gives promise of yielding results of broad significance. 


Drugs 


Allied to the modification of physiological condition through control 
of diet is the injection of drugs as a means of altering seizure suscep- 
tibility, There are a number of drugs, notably metrazol and insulin, 
which have been used as the precipitant of convulsions in the rat (68, 
80, 141, 142, 174; cf. also Section II). The pattern of this response 
differs from that involving intense sound or low intensity electric 
shock chiefly in its lack of a running phase and, with metrazol, the pro- 
minence of its characteristic spinal torsion. 

Metrazol, when given in subconvulsive doses, so affects the rat that 
an auditory stimulus otherwise ineffective becomes adequate to evoke a 
Prompt seizure (56, 78, 106, 145, 154). Such an attack includes a 
running component, and in this way resembles the usual sound-induced 
Pattern more closely than it does the metrazol convulsion, although its 
Severity is often greater (78). The tonic flexion typical of the drug 
Convulsion is observed only occasionally with this combination (106). 
Other drugs successfully utilized to lower resistance to sound include 
Strychnine (2, 153, 154), caffeine sodium benzoate (154), sodium phos- 
phate (50), coramine (166), eserine in adequate dosage (72, 73), nicotine 
(72, 74), and perhaps sodium bromide administered prenatally (64). 

A few substances reduce susceptibility to sound-induced seizures: 
adrenin (2, 87), dilantin sodium (21, 54, 148), adrenal cortex (54), 
atropine (72, 73), and small doses of eserine (74). Rats rendered sus- 
ceptible to sound stimulation by strychnine injections are protected 
rom convulsion by adrenin (1), but the simultaneous use of atropine 
and eserine does not yield a similar neutralization (74). Semiparalyzing 

Sage of curare appears to abolish the susceptibility of rats on normal 
ie rations (88), but those maintained on a low-magnesium 
S can still exhibit seizures under the influence of subparalytic 
eto of curare (53), as do thyro-parathyroidectomized rats on a 
an calcium diet (50). Phenobarbital (77), mecholyl, benzyl benzoate, , 
stim Pituitrin (74) fail to affect significantly the response to auditory 

ulation, 
adre Onvulsive threshold to cerebral electroshock is uninfluenced by 
o nin (125), atropine, calcium gluconate, thromboplastic suspension 
RE substance, and vitamin K (66). The susceptibility is perhaps 
haa slightly by 1.0 cc. injections of alcohol (125), and light ether 
os netic eliminates the convulsions (66). Animals rendered sensitive 

RS by magnesium deprivation are protected temporarily by ie 
Prob amytal, while the midbrain drug picrotoxin perhaps enhances the 

ability of seizure (53). 


n outlining the convulsant properties of drugs, it is always essential 
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to add that the “typical” effect of a particular drug can be altered by a 
great number of characteristics of the organism and of the external 
situation (69). These additional factors must be taken into account by 
the investigator adopting the pharmacological approach to convulsive 
behavior. 


Previous Auditory Stimulation and Convulsion 


It is seldom possible to elicit the full convulsive pattern by renewed 
stimulation directly following a seizure induced by auditory excitation. 
Certain exceptions to this trend appear, and in some cases of severe 
dietary deficiency a series of repetitive convulsions resembling status 
epilepticus may be observed. (As has already been pointed out, no suc 
refractory period follows the electroshock convulsion.) In general, 
however, 1t appears that immediate susceptibility is greatly reduced by 
convulsion, and even by auditory stimulation unaccompanied by sei- 
zure (101). But the various investigations of this adaptation process F 
it extends over longer periods of time, involving repeated trials, yiel 
divergent results. Several experimenters (11, 64, 76, 153) find that wit® 


: s 
fiths (56) found that repeated stimulation actually made the animal 
less resistant to convulsion, as indicated by a shortened latency a” 


ia r as returned to its former level, 
vuisive response may fail to appear. This presumption is confir 


the observation that animals regularly responding at 24-hour inter 
fail to respond when the trials are given at shorter intervals (38). lus 
course, even when the recovery is incomplete, a more intense stim y 
may nevertheless induce seizure. Itis this possibility that explains W z 
in one testing situation adaptation takes place while in another it 1§ n 

evident; the rats in Patton and Karn’s and Parker's experiment ne 
have been stimulated more intensely than were those animals dems 
strating adaptation, and Maier and Glaser’s key jingling may ha io 
provided more effective (high frequency) stimulation than did their # 
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blast. In summary, the probability of a rat’s response depends not only 
aot such factors as his genetic constitution, age, emotional state, and 
ietary background, but also upon the interval since the last testing, 
ire intensity of the auditory stimulus, and his individual rate of re- 
ery. Why it is that threshold should vary with experience is a chal- 
enging problem for the neurophysiologist. 


Neural Ablation 


wee of the most obvious techniques by which the nature of con- 

the eT be studied is operation on the nervous system, The role of 

ininen itory receptors, the participation of the cerebral cortex, the 

t Coreta of lower brain centers, autonomic involvement—these are 

aa Open to surgical investigation. Unfortunately, technical 

appr les exist which have tempered the enthusiastic adoption of this 
Proach, and as a consequence little information is available. 


em airal lesion. Maier’s original report (92) included three cases in which 
spite ars e responses to difficult discrimination plus air blast were observed in 
and oe a cortical extirpation (15-20% of the frontal cortex in two rats, 
results of the median region in the third). According to Smith (153), certain 
Villa Suggest that removal of the frontal pole of the cortex abolishes the oe 
Wit . seizures” (p. 326). Two experiments have attacked the problem directly 
tible sigh results. Beach and Weaver (7) subjected the three most suscep- 
than dues group of 12 rats to two operations during which was removed more 
hippaa A of the neopallium, as well as most of the corpus callosum, much of the 
ter eo and parts of the corpus striatum. When tested about two weeks 
trasted ne decorticates responded on each of 12 tests to jingling keys, as con- 
with 91% of 42 preoperative tests. Unfortunately, the control animals 
a increased in susceptibility during the interval, responding 79% of the 
Bogs compared to the previous 36% positive trials. In each group the pro. 
ör a of seizures which included the tonic-clonic phase increased. W einer an 
tinge (169), on the other hand, found that while the percentage ee in 
5 no rgical control group did not change significantly, there wasa T in 
Sufferi mber of seizures exhibited by the 20 experimental animals. Among t yo 
in then! Prefrontal lobectomy, the number of responses dropped from 22 to Di 
ose Se with lesions in the auditory area, there was a reduction from 41 to 4; _ 
is gj With motor area lesions had 10 as contrasted with a previous 30 attacks. It 
the gnificant, however, that only one seizure in the prefrontal group included 
ther nic-clonic component of the convulsion, while all seizures appearing 1n the 
Operated groups were of the full convulsive type. No change in average 
Y of seizure was found, in contrast to the marked decrease reported by 
nd Weaver, : ; 
imi. reconcile these divergent results seems impossible. The differences in 
Period. and strain of rats, original level of susceptibility, operative recovery 
Perim and extent of destruction qualify any direct comparison of the two ex- 
ents. Further definitive work will be necessary. 
Systen omy The possible mediation of convulsions byt 
Suggests operative limitation of its function. It seems on 


he autonomic nervous 
the basis of 
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preliminary data that vagotomy decreases (72) or abolishes (88) the appearance 
of seizures. 


Other Factors 


There are a number of minor factors which apparently contribute to 
individual differences in susceptibility. Some are subject to experimen- 
tal manipulation, and others can be controlled only statistically. A full 
explanation of convulsive behavior must take them into account. 


Blood composition. The relationship between plasma magnesium level and 
convulsion has already been noted (51, 165). Lacey (86) found that the serum 
protein level of rats responding convulsively to the buzzer and air blast was 
higher (significant at the 4% level) than that of nonsusceptibles and suggeste 
neural dehydration as the mediating mechanism (cf. 38). Further, the susceP” 
tible group was characterized by the greater variability in blood sugar, perhaps 
relating to a dysfunction of the sympathetico-adrenal system. 

Endocrine action. Little work has been done relating the endocrine balance tO 
seizure threshold. Removal of the thyroid and parathyroid glands, coupled wit 
low calcium diet, tends to increase susceptibility to stimulation-convulsion 3 
No effect of oestrous cycle was observed by Mirsky, Elgart and Aring (11 ) 
which suggests lack of pituitary-ovarian influence. : 

Heart rate. In a group of rats selected for their convulsive response to air 
blast, Lindsley, Finger and Henry (88) found that in trials culminating in con- 
vulsion the initial heart rate averaged 379.4 per minute, increasing 20.1 b 
per minute after the onset of stimulation, while in nonseizure trials the! 
rate was 398.2 and the average increase only 4.4. These intertrial differe” 
were not statistically reliable, however. 

Startle response and general activity. Turner (167) found no di! 
tween susceptibles and nonsusceptibles with respect to diurnal activity P4 
The startle response of the susceptibles was clearly the greater, however, 4 
the amount of their activity during brief periods of moderate noise. arn- 

Learning ability. In an exploratory study of the relationship betwee” le na 
ing ability and seizure susceptibility, Sisk (152) gave 35 rats 14 daily trials pall 
15-unit T-maze and then tested their convulsive reaction to the sound ot a 
The six responding positively to the noise had exhibited an average ma 
formance slightly inferior to that of the nonsusceptibles, but the difference ” 
too small to be more than suggestive. . the 

Physical condition. Farris and Yeakel (34), finding that rats tested 1° he 
spring and summer reacted more violently to sound than did those teste ond” 
winter, have suggested that the significant variable is general physical č and 
tion: the more susceptible rats were from litters born in late fall and winteñ 
typically less sturdy. (Cf. also 95, 131, and Nutritional Factors above): pody 

Body temperature. There seems to be a tendency for the lowering ° ound 
temperature to decrease (97, 101) or abolish (94) seizures in response t? other 
although the experimental demonstrations are somewhat complicate 
factors. E 

Combinations. In an attempt to establish physiological patterns distin’ use 
ing between rats responding convulsively to noise and nonsusceptibles, Ma 
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and Moore (110) compared measures of their retiring behavior, sensitization, 
Weight, frequency of defecation, and cardiac action. By using a composite 
criterion including at least two of these indices, “...a group was selected 


ae comprised 70% of the susceptibles and 18% of the non-susceptibles . . . " 


V. Tue Errect OF CONVULSIONS 


Because of the possible applications to problems of human shock 
therapy and epilepsy, a number of observations have been made of the 
effect of convulsions upon other behavior patterns and the structure of 
the rat. Most of the specifically planned experiments have been con- 
cerned with learning and retention, but incidental observations are 
recorded regarding several other points. 


Learning Ability 


bec few experiments in which the effect of convulsions upon learning 
‘a8 been investigated point generally to a slight decrement in acquisi- 
tion ability, especially as measured by speed of response (16, 71, 141, 
149), However, the differences are so slight, the scope of the research 
So limited, and the operation of distorting factors so probable (e.g., 
Motivation, 6, and emotional state, 149), that this conclusion can be 
Presented only very tentatively. The results obtained in future experi- 
ments will undoubtedly depend upon such variables as the kind of 
hima situation employed, the age and physiological condition of 

z rats, the number and spacing of the seizures, the diffculty of the 
nig problem, and the interval between the convulsion series and 
he learning test. 


Retention 


Ti disrupting effect of convulsion upon material already acquired 
hay een subjected to somewhat closer scrutiny. Four investigators 
ee employed loud sound to induce the convulsions. Three experi- 
s (55, 151, 160) revealed no appreciable loss of a maze habit after 
time cote, although there was in some cases a tendency for the 
no of running to increase unless some form of forcing was adopted. 
oe i experiment (108) rats that had had 1-5 (?) seizures in the food 
id a the maze made significantly more errors on relearning trials than 
Cat S animals stimulated similarly but having had no seizure. The 
ria menter, however, felt that the deterioration in maze performance 
a Sa product of the superposition of fear on a learned pattern, rather 

n a loss of a learned pattern” (p. 34). Taken together, these results 
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suggest that although there may be a change in general behavior as the 
result of a limited number of sound-induced seizures, there need be 2° 
disorganization of recently learned maze habits. Of course, there is 2° 
reason to assume similar stability of response following a longer series 
of convulsions. x 
The deleterious effect upon retention of even one or two convulsions 
induced by metrazol or insulin seems quite definite. In only one inve 

tigation (68) did the convulsed animals respond as well upon retest a 
did the control group, and the significance of this single finding is limite 
by the small number of convulsions involved (average of two)- m 
other investigations involving maze behavior the seizures did lea 1l 
deterioration of the habit, particularly so with more difficult, less We 
learned, and more recently acquired responses (9, 89, 141). Insulin 2 p 
metrazol convulsions have also been shown capable of effecting the i 
covery of an extinguished conditioned response (46, 80). On thee 
hand, there is no indication that either a fixation in the Lashley jump? 
apparatus or the well-learned discrimination habit itself is affecte 
metrazol convulsion (82, 103). . ijarlý 

The results of experiments employing electroshock are smi ae 

positive under certain conditions. It appears that very simple ha a 
(e.g., a running response, a two-choice T-maze) may not be impair jal 
at least as measured by the number of errors on the retention a 

(149, 160). However, on more difficult mazes and in one discriminat pe 
situation, retention scores have been lower, and certainly 5° š t 
number of shocks is increased (29, 71, 156). In at least one instanc? al 
deterioration of the pattern seemed to be due not just to empi till 
factors, but to a “direct effect on cerebral tissue” (29, p. 278). ; d 
another experiment, an extinguished conditioned response reap 
after asingle electrically-induced convulsion (80). 

‘ As in the series of studies involving learning ability, these e 
tions of retention are too limited in range to be conclusive. It apP ith 
that, given enough convulsions, there is measurable interference? ded: 
certain habits. An extension of these experiments is sorely ot ie 
with variation of difficulty and age of the pattern, number and pi vals 
of practice trials, number and type of convulsion, retention mes 
and method of retesting. And as Stainbrook (158) has intimate “hap? 
always essential to differentiate between actual loss of habit pen an 
indicated by the criterion of errors or number of relearning trials) by 
the generalized effect of emotion or lack of alertness (as indicat? esti 
time measures). The obvious corollary of these studies js the io ral 
gation of the neurophysiological changes paralleling the behav 
modifications. 
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Reasoning Ability 


ene P, Winder and Stone (146) studied the effect of electroshock 
Sie A E ability as measured by Maier’s apparatus (90) which 
ele . ~, bringing together spontaneously two elements of past 
(91 7 without having them previously associated by contiguity” 
bt am ). The tests were given 14 or 22 hours after electroshock 
differen on. i Although only six subjects were used and individual 
distans ces were apparent, the trend was in the direction of decreased 
aa ein exploratory runs, increased time for test runs, and decreased 
Shen 7 scores. The impairment was greater 13 hours after shock than 
e after 22 hours, indicating that at least part of the dis- 
dpe was ofa transitory nature, similar in this respect to the effect 
8g in retention studies. 


Emotional Patterns 


Mesa course of experiments testing learning and retention following 
other a some observations have been made of changes occurring in 

Stainb ehavior patterns, particularly those described as emotional. 
try (156, 160), for example, reports that after a series of elec- 
manifested a, experimental neurosis” appeared, during which the rats 
Upset b e a and resistance when approached, and appeared 
Page hoes ight noises. The condition persisted for several months. 
experime ) found that while electrically convulsed animals resisted the 
e T during the first few days, they soon became passive and 
other n na two experiments, one employing shock (149) and the 
Which t ast (108), the rats seemed trying to avoid the situation in 
disturban, convulsions had been experienced, and evidenced emotional 
is the a when placed in the situation again. Directly contradictory 
Slectrogh Servation of Duncan (29) that animals having undergone 30 
a convulsions evinced no fear in the experimental situation, 
“Mployin hae to rats shocked on the legs. Most experimenters 
ance of a aad sound as the convulsive agent similarly report no avoid- 
bis € situation by the rat (e.g., 92). aie 
known dy of the available data makes it clear that too little is 
n the aa the generalized behavioral changes following convulsion 
liberates It is desirable that future experiments be oriented more 
y and systematically in this direction, with objective meas- 


Tes a 
OPted to reveal the possible changes. 
Activity 


No 
very refined technique is necessary to 


aCtivit detect the decrease in 
Y immediately following convulsion. T 


his relative im mobility 
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varies in degree and duration from seizure to seizure, often retarding 
maze running for a period of several hours. Only three attempts at 
quantitative evaluation of activity changes have been reported. In the 
one involving sound-induced seizures (42) it was found that auditory 
stimulation without convulsion did not affect subsequent activity, but 
following a seizure the activity level dropped significantly. Recovery 
was apparently complete within ten hours, with no cumulative effect 
discernible after three seizures. Unfortunately, the number of attacks 
induced was too few to support any generalization regarding long-range 
modification of diurnal pattern. 

It appears that a single electroshock likewise depresses activity 
measurably, and that during a period of daily convulsions the activity 
level drops significantly below normal (162, 170). The effect is not 
long-lasting, however; some recovery is evident in the first few days 
following termination of the series, and the preconvulsion level is ap- 
proached within 2-4 weeks. 


Mating and Maternal Behavior 


Some of the early investigators reported difficulty in breeding rats 
shown to be susceptible to sound-induced convulsion (e.g., 100). This 
might seem to imply that the convulsions themselves exert a depressing 
influence upon mating. The most direct test of this supposition is a” 
experiment by Farris and Yeakel (32, 35). They found that animals 
subjected to air blasting five days a week beginning at weaning produce 
30% more litters than did control litter mates not subjected to ps 
sound. On the other hand, significantly fewer offspring of these €% 
perimentals than of the controls survived the first day of life, due in par 
to a noticeable inefficiency of maternal behavior among the experimen” 
tal mothers (cf. 57). Unfortunately, no breakdown of figures is given 
that would permit a separation of the influence of the auditory stimu 
lation itself from that of the convulsions produced thereby. 


Physiological Changes; Death 
rom the 


To the extent that behavior changes are found to result f 
phys!° = 


experiencing of convulsions, parallel changes in the structure or phy 
ogy of the organism might be expected. In general, no very St"! prs 
alteration has been found, perhaps because relatively little attenti? 
has been paid the problem. 

Rats (as well as other laboratory animals) undergoing re 
convulsions, induced either electrically or by loud sound, gra : 
deteriorate in general physical condition and gain less weight than 
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peate 
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control animals (42, 125). It is possible that the systolic blood pressure 
becomes abnormally high (115). According to one exploratory study, 
exposure to sound, with or without seizure, causes the deposition of 
Pigment granules in the adherent mucus of the stomach walls, but 
with no evidence of ulceration (104). Electroshock convulsion (and not 
the shock itself) produces hemorrhages in the pia-arachnoid and the 
parenchyma, presumably because of circulatory changes accompanying 
the convulsion (66). Finally, studies of the blood indicate that, fol- 
lowing audiogenic seizure, “the gray Norway rat rapidly developed a 
shorter blood coagulation time, and an acidosis caused by acids other 
than carbonic in the blood” (22, p. 525). It will be noted that most, if 
not all of these physiological changes are of a temporary nature. Direct 
evidence of permanent modifications, e.g., in the central nervous sys- 


- tem, is lacking. 


The probability of death in convulsion apparently depends for the 
most part upon the age and physiological condition of the subject. 
tdinarily, very few deaths occur during sound-induced seizure, al- 


though up to 4% mortality has been recorded among very young 


animals (4, 153). In convulsions associated with nutritional deficiency, 


death seldom occurs if the deficit is moderate (17), but in extreme cases 
of magnesium deprivation death during convulsion, within 30 days, is 


the Virtually invariable outcome (53, 85). 


VI. PuystoLocicaL Bases Or CONVULSIONS 


When the psychologist is puzzled by the appearance of an atypical 
Pattern of behavior, he often looks to the physiology laboratory for an 
answer, Or at least he remains uneasy until an answer to his “why 
can be phrased in physiological language. An imposing array of be- 
avioral observations confronts the student of rat convulsions, but the 
‘Nterrelationships of the numerous variables are so complex (and so 
requently ignored experimentally) that a coherent picture is still , 
acking. Unable to systematize his behavioral findings at the gross 
level, the investigator searches for a description at the neurophysiolog- 
‘cal level, to which the diverse external manifestations may be referred. 

n short, the first five sections might be considered a mere preliminary 
© this section, which should be the climax of the review. J 
nfortunately, however, no thoroughgoing theory of convulsion 
aS been attempted, nor shall I essay one here. In reality, the prime 
Purpose of this report has been to present the problem in such a way 
at the reader with an adequate background will undertake his Ta 
Ormulation. It may be of value to mention again some findings tha 


ow 
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seem particularly deserving of attention in such a theory, and to add a 
few conjectures that have been made regarding them. But all this is 
intended only to hint at one direction which speculation may take, and 
is certainly too tentative to be a restraining influence on the flow of 
hypothesis. Even the contradiction of experimental data need be no 
final deterrent, for a great deal of the work demands repetition with 
more satisfactory controls. 

It would appear that in the convulsions attributed to conflict, 
auditory stimulation, minimal electroshock, and nutritional deficit, the 
neural discharge finds a final path that is approximately common. 
only the tonic-clonic stage were considered, this statement could also 
include certain drug-induced seizures. That this intense discharge is 
central in origin is suggested by several lines of evidence. 


1. Electroencephalographic recording before and during audiogenic seizure 
reveals cortical involvement (88). 

2. Cortical extirpation may decrease the probability of seizure to sound 
(169). d 

3. The mode of interaction of the convulsion-facilitating eserine and the 1n- 
hibitory atropine indicates that their influence is through the central nervous 
system (73, 74): 

4. Coramine, which increases the reactivity of the central nervous system, 
in subconvulsive doses increases convulsive susceptibility to sound (166). 


But there is also reason to believe that subcortical and autonomic 
centers contribute to the innervation. 


1. Cortical extirpation does not always abolish seizures (7). acu te he 

2. Although the total number of seizures may be diminished by lesions int is 
motor cortex, the proportion of attacks which include the tonic-clonic phase 4 
thereby increased. Weiner and Morgan (169) suggest that the normal func? 
of this region tends to inhibit the more incoordinated (convulsive) discharg 
which has a subcortical origin. f 

3. According to Morgan (119) and Weiner and Morgan (169) latency p 
seizure is uninfluenced by a number of factors, including cortical extirpation: 
which affect threshold of seizure. This timing process may thus be a subcortic® 
function. Relevant to this point is Smith's suggestion (153) of a physiolog!® 
timing mechanism, cyclic in character. 

4. Changes in heart rate during auditory stimulation are prognostic 
ceptibility during that trial. Both these changes and the susceptibility t° $ 
are minimized by vagotomy (88). i nd 

5. In metrazol and electroshock convulsion the autonomic system 1$ te 
to be stimulated, with the action of the sympathetico-adrenal system p 
dominating (37, 80). 


of sus” 
eizure 
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The puzzle is: Why do the central overloading and the convulsiy e 


synchrony of central firing occur? A number of possibilities may 
mentioned. 
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1. Relative cerebral anoxia seems to accompany convulsion induced by 
electroshock and certain drugs (25, 79). Such a condition may well alter the cell 


eee surfaces so that the tissue becomes abnormally sensitive to stimula- 
10n. 


2. In states of malnutrition a parallel sensitization might take place, so that 
normally ineffective stimuli would serve to cause mass response (cf. 136). 

k 3. In instances where these factors are usually absent, auditory stimulation 
might induce a temporary anoxia. Whether the mechanism involved should be 
termed reflex or emotional cannot be stated now. 

4, Due to the proximity of the auditory and the motor centers, direct spread 
of excitation may be uniquely facilitated when this modality of stimulation is 
employed. 

5. Hypothalamic centers may be excited rather directly, discharging both 
Centrally and peripherally, via the autonomic nervous system (88). In the in- 
tact organism reverberation from the visceral organs may supplement the origi- 
nal cortical innervation, leading to explosive overflow unless constantly drained 
off by emotional (2, 114) or substitute (56, 76, 88) reactions. 


It is obvious that these suggestions are little more than isolated 
Phrases in the physiological interpretation that finally must be made. 
More work, particularly involving electrical, pharmacological, and 
Operative techniques, must be planned with neurophysiological hypoth- 
eses as the guiding orientation, at the same time recognizing the 
multiplicity of factors that must be segregated and controlled. It 
appears that breaking down the classificatory distinctions between at 
least some of the “types” of convulsion would facilitate this approach. 
What the final categories will be, and their relation to human paroxys- 
mal behavior, must wait on a more complete experimental integration. 
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A CRITICAL REVIEW OF THE LITERATURE ON 
“ABSOLUTE PITCH” 


D. M. NEU 
Indiana University 


Pitch discrimination and, more specifically, “absolute pitch” have 
been a source of controversy and speculation for a great many years, ` 
beginning with Stumpf’s discussion of Mozart's sense of pitch (43). 
In general, absolute pitch is the ability of an individual to name cor- 
rectly a particular tone that is sung or played on an instrument, 
without comparing it to any other heard tone. The explanations of 
absolute pitch have been varied and many. The inadequacy of these 
explanations creates our present problem. Even more so than other 
forms of abilities, musical talent, especially absolute pitch, has tradi- 
tionally been explained as an inborn quality or faculty that relatively 
few people are privileged to have. 

In view of the fact that an inborn quality has been accepted so 
generally as an explanation of accurate pitch discrimination, one might 
hesitate to offer any other explanation if it were not for the fact that no 
real evidence has been presented to prove that there is such a quality. 
Moreover, if all the work on pitch discrimination is reviewed, while 
keeping in mind the real event taking place in pitch discrimination, the 
evidence points strongly away from any faculty or quality inherent in 
the individual. Rather, the evidence might show that the ability to 
discriminate pitch depends upon the life conditions of the individual and 
IS probably built up during his lifetime. 

It is the purpose of this paper to criticize the deftnitions, theories, 
and experiments relating to absolute pitch, from an interbehavioral 
Standpoint (16). This point of view stresses the behavioral event taking 
Place when an organism is interacting with a stimulus object in any 
Particular situation. The stimulus value that the object has for the 
individual depends upon the individual’s life history, the situation at 
the time, as well as upon previous interaction with the object. The 
response that the individual makes depends upon the stimulus value 
or function that the object has for him in any particular situation. 


DEFINITION OF ABSOLUTE PITCH 


The definition of the term “absolute pitch” has caused the greatest 
Confusion in evaluating data from experimental work, because of the 


Mahy different meanings of the term. As a result each investigator 
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criticizes every other investigator’s work which yields results different 
from his own, by saying that the other investigator’s subjects did not 
have absolute pitch or that the conditions allowed the subjects to use 
relative pitch, or any number of other similar reasons. 

Petran (30) describes absolute pitch as follows: 


... on the psychological side is an inherent assumption of the extensity and 
telephone theories. With this assumption in mind, judgments of absolute pitch 
may be defined as judgments based on association learned between more or less 
narrowly limited ranges of the pitch series, . . . these judgments being without 
reference to or aid from any tone or tones recently heard which have been given 
asa standard or attended to in any degree as being of a certain pitch or familiar 
pitch position (30, p. 12). 


Lewis (24) defines pitch as an attribute of auditory experience and 
not as a characteristic of acoustic waves. 


Pitch is that attribute of auditory experience which determines the positions 
of sounds in a psychological continuum extending from low to high .. . Pitch ts 
no more a property of sound waves than color is a property of light waves: 
Pitch arises as an attribute of auditory experience when the ear is stimulated in 
appropriate ways, just as color arises as an attribute of visual experience when 
the eye is appropriately stimulated (24, p. 121). 


The performance in absolute pitch is defined by most writers as 
designating a heard tone correctly from memory alone, without any 
other aid. In the opinion of the present writer, absolute pitch might be 

better defined as the ability to discriminate tones without the aid © 
_ other tones to such a degree that naming or pointing out the note 16 
rarely incorrect. 

Bachem (3) probably has made the most complete attempt Oy 
categorize ways of discriminating pitch. Genuine absolute pitch 15 
based upon immediate recognition of the chroma, which would be the 
same definition most investigators would give to it. Quasi-a solute 
pitch, he says, is based upon aural or vocal standard an 
sense. Pseudo-absolute pitch is based upon the estimat 
height. Bachem’s types of pitch would probably be agree! 
most writers. He has only given names to the various degrees 
discrimination. However, it is only too obvious from the literature © 
many investigators have taken what Bachem calls quasi- Or P le 
pitch in judging genuine absolute pitch. For this reason a very relia 
collection of data from absolute pitch experiments cannot be made. fe 

Bachem says that the outstanding difference between the absolue 
pitch of most investigators and his type of genuine absolute pitch 3s i 

accuracy of judgment. The average error of judgment in cases 
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“pseudo-absolute” pitch amounts to about five half-tones (after train- 
ing) and in “genuine-absolute” pitch it is one-eleventh of a half-tone 
according to Abraham, and even less according to other authors. We 
would agree with Bachem and others that absolute pitch discrimination 
is a matter of accuracy of judgment. This evidence furthers all the 
more the explanation that absolute pitch is merely discriminatory 
ability developed to a very fine degree. Ability here does not imply 
some special gift which the individual has inherited, but a development 
of auditory discrimination to the greatest accuracy during the lifetime 
of the individual, based upon his reactional biography. Thus, absolute 
Pitch is only the most accurate degree of pitch discrimination. 

Although Seashore is probably the foremost proponent today of 
absolute pitch as a mysterious gift or power, he unknowingly substan- 
tiates our view by stating: 

As to the degree of accuracy of free tonal memory, we finda gradual transition 
from the very finest absolute pitch through all degrees of accuracy down to 
crude forms of acquired pitch. But it is convenient to make the distinction that 
absolute pitch may be thought of as involving accuracy to a very small fraction 
of a semitone; whereas acquired pitch is usually thought of in no finer terms 
than semitones (37, p. 18). 


THEORIES Or ABSOLUTE PITCH 


Practically all explanations of absolute pitch assume an inborn 
quality of some sort, and in some degree, although the trend in recent 
years has been to regard the ability as an acquisition. Probably the 
best known theories of absolute pitch are the so-called physiological 
theories. However, we shall not consider physiological theories in this 
paper for we strongly maintain that the stimulus object and situational 
factors involved must be considered, as well as the individual, in any 
explanation of behavior. The above does not mean that we are ignoring 
the important work that has been done by such men as Stevens (40, 
41, 42), Thurlow (44), Wever (48), Wever and Bray (49, 50, 51), 
Wilkinson and Gray (53), and others upon differential “place” respon- 
ses in the cochlea, but we maintain that even if such differential phy- 
Siological reactions take place, they do not adequately explain pitch 
discrimination because they do not represent the entire behavioral 
event which occurs. 

Physiological and physical data must not be confused with psy- 
chological behavior. Since we are interested in the psychological or 
behavioral event, that is, pitch discrimination, it is necessary for us to 
State what this psychological event consists of and thereby point out 
the futility of most of these theories, and their inability ever to give @ 
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satisfactory explanation. In the act of discriminating a “pitch” we 
are interested in the particular individual determining a tone. That is, 
the tone is the stimulus and the factors that operate in the interaction 
between the individual and the tone are our problem. Itis clear then 
that the theories based only upon physiological or physical data are not 
considering the actual psychological event, but rather only a part of it, 
the medium of contact. When we use the term, media of contact, we 
are speaking of the means by which the individual is able to react to the 
object. For example, in the case of pitch the individual is able to react 
to a tone because a medium of waves is present and they enable the 
individual to make contact with the sound of the stimulus object and to 
respond to it. The sound waves, then, are not the stimulus object but 
only the medium of contact between the individual and the stimulus 
object. 

Practically all views of absolute pitch involve one of two explana- 
tions: 


1. The individual may possess a special ability, differing from anything 
sessed by the average individual. 
2. The individual may possess a high degree of some ability which is found 


to some extent in all persons. 


Boggs (8), Révész (33), and Baird (5) are exponents of the first 
view. They believe that quality is the distinguishing feature of a tone 
in absolute pitch judgments. Boggs was at first inclined to believe that 
ability to hear overtones was the cause of keen pitch discrimination, but 
later changed her view from timbre to quality. Quality, she says» = 
ables a person to hear tones in much the same way as those with c° p 
vision see colors. Baird’s view is similar. He bases it chiefl 
errors and on immediateness of judgment, like the recog” 
when once it has been pointed out as red. Révész makes 4 
distinction between those who recognize tones by pitch, and those 
recognize them by quality. Both are supposed to have genuine absolu a 
ear but the latter group has judgments more immediate an PA 
correct. Révész holds that ability to distinguish quality is innate, eF 
colors in vision, and cannot be acquired. Pitch-memory, 0” he ne 
hand, may be acquired and perfected by diligent practice. T° wor 
the influence of timbre, he says it is possible that when other parts new 
tonal complex are changed the quality cannot be recognized 10 the 
combination. 

For von Kries (20), absolute pitch is something which cann ote 
acquired, while Stumpf (43) leaves the question open and give aye 
weight to familiarity, musical ability, feeling, and interest aS 
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in practical musical activities. Mull says, “The most convincing proof 
of the spuriousness of tonality, in Baird’s sense of the term, would be 
success in training average individuals in absolute pitch to the point 
where they are similar to the ‘gifted’ individuals in every respect” 
(28, p. 475). 

Seashore says, “Absolute pitch is an inborn predisposition which 
manifests itself during childhood and is strikingly immediate and 
spontaneous, . . . ” (35, p. 18). He believes that absolute pitch involves 
errors of a small fraction of a semitone, whereas acquired pitch operates 
in terms of steps in the musical scale. Seashore considers that both are 
forms of memory and that both rest upon a favorable hereditary capac- 
ity, although both are influenced by training. But absolute pitch is 
influenced only to a slight extent, while acquired pitch can be learned to 
some degree by any person with a good sense of pitch and it is dependent 
upon knowledge of musical steps and notes. ~ 

Explanations of absolute pitch which do not rest upon a special 
ability are held by Meyer (27), Köhler (18), Watt (45), and Ogden (29). 

Meyer (27) says that if we are to assume that some people are 
gifted with absolute pitch, it means that there is a physiological proper- 
ty, the lack of which prevents the acquiring of this mental faculty. It 
would be difficult to say what the physiological property would be. 
Rather than to make such an assumption, Meyer favors the idea that 
everyone potentially has the ability. He says that everyone has a 
certain amount of memory of pitch, since they are able to recognize 
and discriminate. Meyer also states that there is no reason proved by 
any experiment which shows that the individual by systematic and 
sufficiently lasting practice may not be trained to better ‘‘absolute 
pitch.” He considers some features of the tone-body, rather than pitch, 
to be the basis of absolute pitch judgments. 

The theory that tone-body, and not pitch, is important has been 
developed by Köhler (18). Tone-body includes brightness, vocality, , 
volume, and intensity. It is recognized immediately without the inter- 
vention of a memory image. Watt (45) disagrees with Kohler’s list of 
attributes and considers that Kéhler has merely shown that timbre is 
important. Watt believes the attributes of tone to be two, pitch and 
volume. Both are taken into account by absolute ear, where there may 
be either a special refinement of sensory apparatus or a better auditory 
Memory. But he thinks that in many cases of absolute pitch there is 
required also an absolute point of reference in auditory pitches. The 
t of Jocalization of the skin, where, however, 


Problem is analogous to tha 
does not exist. It is due to the lack of 


the great influence of relativity 
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relativity that tactual localization is possible for all persons. Auditory 
localization is possible only to a few who have not allowed the relative 


ear to dominate the absolute ear. According to this view, we allhave | 


had a chance to possess absolute pitch, but because of our education in 
musical relationships we have lost it through disuse. 

Ogden (29) bases absolute ear upon a refined sensitivity to different 
clang-patterns and also insists upon the necessity of reference to a 
scale with which the individual who possesses the talent is familiar. 
This explanation is essentially like that of Watts. With Ogden, how- 
ever, the point of reference seems to be introduced for purely systematic 
reasons and to be considerably overemphasized, especially when one 
remembers that good interval-judgment is not a requisite for absolute 
pitch. 

Some investigators think that movement of the muscles in the 
throat are a help in making a judgment of pitch. Lotze (26) and Koh- 
ler (18) thought there must be a “larynx sensation ” and thus pitch 
discrimination was limited to those with singing voices. Ladd an 
Woodworth (22) thought an innervation of the larynx is present when 
one is judging a note. Gough (13) in an extensive experiment cou 
find no evidence of this. Wyatt makes this statement: 

. . . it seems probable that proficiency in a pitch discriminatio 
affected by many factors which are quite remote from simple auditory 
and which may involve, not just the auditory sensorium, but possibly ev 
entire organism (58, p. 54). 


Lewis has progressed farther than most investigators i 
the pitch discrimination of an individual to that particular 
experience. However, he is holding to traditional ideas 10r ne then 
“The sense of pitch, an innate capacity... (23, P: 349). But ical 
again, he also says: “Pitch is not a hard and fast thing like 4 et 
quantity. To regard pitch as something rigid and fixed is to xen oint 
the basic meaning of tonal experience” (23, p. 349). Lewis’ mai" f the 
is that pitch is not to be confused with frequency, whic most © 
older writers have made one and same (30). i gat 

Bachem (4) thinks inheritance, attention, and experience a more 
factors in the creation of absolute pitch. His factors presen the biO” 
objective approach, especially if by inheritance is meant only the 
logical make-up of a human being. Attention is important sor t? 
individual must attend to or contact the stimulus object 57 ii life 
interact with it, and experience is the reactional biography peha 
history that is more important in determining the indivi ual’s 


ior (16). 
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Even though the interactional history with the tone stimulus may 
be subtle, it is clear that the reactions of the individual would have 
better opportunity to be highly developed if interaction with the tone 
were deliberately undertaken. We mean by this statement that the 
discrimination of pitch, even when it is developed to a point where it is 
“absolute,” is developed during the lifetime of the individual. Absolute 
pitch is a matter of the keen development of attending to the stimulus 
(the tone) and the keen development of perceiving the stimulus (differ- 
entiating the tones). 

If we are to consider that pitch discrimination is developed not from 
some inborn power but during the lifetime of the individual’s reactions, 
it is not important whether the investigator elicits pseudo, quasi, or 
genuine absolute pitch in his experiment. It is not important because, 
from this interbehavioral view, the individual's accuracy of discrimi- 
nation reflects only the degree to which he has interacted with tones 
during his lifetime. The individual with discrimination of the kind 
that we have chosen to call ‘‘genuine’’ absolute pitch has obtained a 
psychological interaction with different tones equivalent to what most 
of us have with the various colors. This does not mean necessarily that 
the individual has to “train” himself or to practice deliberately with 
tones in order to judge them. The interaction with the tone stimulus 
could be as subtle as any psychological interaction can be; for example, 
as subtle as how we learn to distinguish red from green. 

That there are usually great differences between those who “have” 
absolute pitch and those who are “trained” along that line is not the 
crucial point which most writers seem to think. The point that needs to 
be emphasized is the uselessness of merely attributing absolute pitch to 
some unknown power. If we are to find a scientific explanation, we 
must deal with the actual event of the tone and the organism discrimi- 
nating it. With an interbehavioral explanation we can deal with the 
facts as they are present in the crude event (16), viz:'the discriminat- 
ing of pitch so accurately that it is “absolute” and the learning of 
this ability during the lifetime of the individual. This means that we 
will have to consider the individual's reaction from the beginning of his 
Psychological interaction with tones of any sort. How the person will 
develop his reactions to tones depends upon the individual in his life 
conditions. Therefore, the individual who cannot discriminate pitch 
after training as well as the individual with absolute pitch but without 
Special training is not able to overcome the effects of reaction to tones 
that developed during his lifetime before training was started. We 
know that the first years mark the beginning of basic permanent behav- 
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ior, and so the older the person the more difficult it becomes for him to 
be influenced by training (16). 


INVESTIGATIONS OF ABSOLUTE PITCH 


Some main points in the results of experiments that Petran (30) 
reviewed up to 1932 were that white notes on the piano are more often 
correctly judged than black; that notes in the middle range are judged 
more accurately than those in the extreme range; and that there is a 
tendency to judge high notes lower and low notes higher than they 
actually are. In a recent experiment Riker (35) found that subjects with 
musical experience were able to judge pitch better in the middle range. 
All of these results show better judgment with notes that in general are 
used most often. In other words, notes that are played more often and 
experienced more often by the individual are the ones he knows best. 

Most investigators did attempt to consider experience as an expla- 
nation, for one of the most frequent explanations offered was that the 
more familiar a timbre, the easier it was to judge. However, von Kries 
(20) and Abraham (2) both rejected this explanation. Stumpf (43) and 
Köhler (18) both concluded that experience must be considered. 
Chiloff (10) concluded that absolute pitch depends on the faculty of 
determining the timbre of sounds of different heights. Baird (5) found 
subjects trained in music to be less accurate with tones on other instru- 
ments than on a piano. Köhler (18) holds that unfamiliar timbre 
recognition is quite impossible. Gough (13) after training both musical 
and unmusical subjects found only comparatively slight differences 1” 
accuracy of judgment with different timbres. Thus, from these resu A 
it seems that familiarity of timbre is important in judgment of pite F 
Further results pointing to familiarity are the experiments by Stump 
(43) who found that a bass player was better in judging tones in the 
lower range than in the rest of the scale and that a violinist was better 
in judgment in the upper range. 

“Association with tones. Association with tones was re 
many observers such as Stumpf (43), Whipple (52), Boggs ( 
Gough (13). One was a keyboard association so that the subjec 
spatial position of the note. Another was an association with note 
prominent song or triad built on the tone. A non-musical assoc} A 
was with pain, such as with high notes, or tones sounding W00 y is 
muffled to the subject. All these results suggest that past experience 
necessary before the associations with which to make judgme 
occur. 

Practice effects. Practically all of those individual 
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pitch are musicians of more than average competency (28). Bartholo- 
mew (6) found that proficiency of subjects increased with the number 
of ear training and harmony courses they had. Jadassohn before 1899 
stated that “the faculty of absolute pitch was easily acquired” (15, p. 
33). The first experiment on acquiring absolute pitch was made by 
Meyer (27) on himself and another person. They learned to name 
tuning forks from 100 to 4500 vibrations per minute by their vibration 
numbers, and thirty-nine notes on the piano, with a rather high degree 
of accuracy. They began with a few pitches at a time and added new 
ones from time to time. They did this with tuning forks and a piano. 
With a choice of 39 pitches, more than half of their judgments were 
correct. After several years the subjects lost the greater part of what 
they learned for want of continued practice. Boggs (8), experimenting 
on herself, improved her judgment by paying special attention to 
Overtones; and Köhler (17) did the same thing by concentrating on 
what he calls ‘“body-tones.” 

Both Gough (13) and Mull (27), experimenting with college stu- 
dents, found that a certain amount of pitch naming can be acquired by 
musical and unmusical subjects. The two experiments show that 
systematic practice in pitch identification is effective. 

The results of Mull’s experiment (28) show that ability to judge 
notes correctly can be greatly improved by training. The improvement 
is largely immediate and is relatively lasting in its effects. Mull states 
that in making judgment of absolute pitch a high degree of attention to 
the notes is more effective than a lower degree, and a note which is not 
attended to has no noticeable effect. She also finds that the ability to 
acquire absolute pitch correlates better with practical musical ability 
than with abìlity in the Seashore tests. Mull concludes that the average 
Person can acquire absolute pitch. She bases this statement on the fact 
that there is a similarity of performance between Baird's “gifted” sub- 
jects and Mull’s group of average persons after extensive training. 
Mull goes on to state that the possession of absolute ear seems to rest 
upon an interest in the notes themselves rather than in their melodic or 
harmonic relationships. The importance of attention is indicated by (a) 
the immediateness and permanence of the learning, (b) the individual 
variability from day to day, (c) the fact that a stimulus which is not 
attended to has no noticeable effect, and (d) the fact that a good hearing 
of the note is necessary before judgment can be made. 

But, as Mull says, “There still remains the question why ce 
individuals without having been trained possess absolute ear so much 
More completely than others.” Mull’s answer to this question presents 
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one possibility for the development of absolute pitch and is certainly 
more feasible than attributing it to an inborn quality. She says: “An 
answer may lie in the suggestion that such persons, probably when they 
were children, found notes interesting in themselves... ’’ (28, p. 492). 

The percentage of correctness among “gifted” persons in Baird’s 
study (7) ranged from 100 per cent to 0 per cent, both by group and by 
individuals. Boggs (8) percentage of correct judgments in his “gifted” 
subjects ranged from 100 per cent to 16 per cent in the same individual. 
Mull (28) thinks it depends upon experimental conditions and upon the 
individual himself. 

Most investigators found the middle pitches to yield the greatest 
precision in judgment, but this was not always the case. Stumpf (43) rec- 
ognized that the greatest precision is in the middle region and that 
pitch discrimination is finest there. But he attributes it to greater use 
and consequent familiarity, for we have already mentioned his subjects, 
the bass player and the violinist, who discriminated best at the lower 
and higher ranges, respectively. 

Riemann (34) said that some musicians have absolute pitch because 
of their frequent tuning of their instruments. Wilson (55) agrees with 
this. Wyatt (58), in one of the most recent experiments, showed tha 
pitch discrimination of initially “pitch deficient” adults was sign” 
cantly improved after intensive training. She places special emphasis 0? 
the need for the right kind of training in order to get improvement 1m 
pitch discrimination. is 

Another argument for the sufficiency of an explanation on the sine 
of previous experience and familiarity is the fact that many inves 
gators have found some order of difficulty among various 
Von Kries (20) finds general agreement among musicians t 
from least to most difficult is: piano, strings, wind instruments, 
whistling, tuning forks, and bells. Baird's order (18) is piano, 

-voice, and tuning forks. Mull (28) points out that it is possib 
this could be due to familiarity, but Baird (7) and von Kries (20) d dy 
care to believe it. The fact that Baird and von Kries have a slighi 
different order shows that it depends upon the particular individu 
experience as to what order he would choose. A piano player ™ ther 
more likely to judge tones correctly on any musical instrumen 
than on a tuning fork or bells with which he would have little cO 
More specifically, a violinist may be able to judge tones mor cel str 
than a pianist because it is necessary for him to play tones on his a as 

ment by placing his finger in the correct position; but a pia 
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merely to touch a key which hits a string already tuned to a particular 
tone. Bachem (4) substantiates this by his statements: 


Musical experience, however, plays a role in the acuity of absolute pitch. ... 
Several (subjects) believed that musical experience improved their quality of 
absolute pitch, and lack of experience allowed it to decline. . . . Two musicians 
found their absolute pitch declining with increasing deafness, which had not 
progressed sufficiently, however, to interfere with their musical activities. ... 
A somewhat greater acuity of absolute pitch was found in violinists than in 
Piano players (4, p. 439). 


_ Cameron (9) conducted an experiment to see the effects of practice 
in discrimination and singing of tones. He found no marked correlation 
between the initial capacities of the subjects tested for discrimination 
of tones and ability to reproduce these tones accurately by singing. 
Practice in singing tones of a certain pitch resulted in marked reduction 
in the error of reproducing those tones in the case of four of six subjects. 
Slight improvement in singing tones of a pitch different from the one 
practiced was made by three subjects and no improvement in the other. 
Subjects who improved in accuracy of singing tones of a certain pitch 
improved also in discrimination of tones at that level. There was no 
improvement for subjects in the discrimination of tones of a different 
pitch from that practiced. Subjects who did not improve in accuracy of 
singing made no improvement in discrimination. 

Lewis (25) conducted an experiment to secure data for constructing 
and comparing pitch scales. The scales he constructed did not agree 
with each other nor with scales based on the findings of Shower and 
Biddulph (39) and of Stevens and Volkmann (42). The difference could 
not be attributed to experimental error. Lewis concludes that any pitch 
scale depends, in some measure, upon the group of persons who made the 
required judgments and upon the particular method of observation 
employed. 

If it is true that pitch depends upon the particular individual’s 
judgment, then each judgment would not necessarily be alike, as Lewis’ 
investigation shows. This result certainly is not evidence that pitch is 
an inborn quality, for if so, then a certain number of vibrations of a tone 
would be heard as exactly the same pitch judgment by every individual. 
Rather, it is evidence that the ability to discriminate pitch is acquired 
in the person’s lifetime, and that accuracy in hearing a pitch asa certain 
note depends upon his own reactional biography. 

Bachem (4), in one of the most recent experiments, studied 103 cases 
from which he devised his three types of absolute pitch discrimination: 
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genuine, quasi, and pseudo. All of these types of absolute pitch discrimi- 
nation occur with variation in acuity, range of the musical scale, and 
influence of timbre. These variations occur because of differences in past 
experience and in familiarity with tones. 

Bachem in an investigation to secure comparative results conducted 
identical experiments on persons with and without absolute pitch. A 
first pair of experiments was performed on the piano by offering a tone 
to a person placed back toward the piano and then having the person 
turn toward the piano and identify the key without striking it. Twenty 
tones were judged this way and another twenty were identified by strik- 
ing the key. In order to compare the results of this with accuracy of 
absolute pitch judgment, several ranges of the piano scale were tuned 
to eight tones, and marked correspondingly. Most subjects identifie 
the tones correctly. 

A second pair of experiments was designed for more accurate results. 
A tone was offered several times on a tone variator which was standard- 
ized in vibrations per second. Every five minutes a different tone was 
sounded and altered according to directions of the subject, until the tone 
was considered the right one. A similar experiment was conducted with 
the five-minute intervals replaced by half-day intervals. 

The average error of absolute pitch determination was 0.6 per cent 
for observations extending over one-half hour. For persons without 
absolute pitch, the error amounted to about 8 per cent on the tone 
variator, representing an interval of one to two half-tones. On the ane 
an error of about five half-tones was found. With these results oa 
says there is conclusive evidence of the great difference in accuracy 
pitch identification in persons without and with absolute pitch. id 

As Bachem explains, attention, or what the interbehaviorist WO" 
call attending to stimuli (16), is an important factor to be considere- 
Attending to stimuli we must consider as having the same sort of chan" 
for development as reacting to tones, given in the explanation 4 e 
An obvious example is the development of hearing of a b 
Bachem (4) tells of a blind person without musical training W 
his absolute pitch accidentally during a conversation with a 
of absolute pitch and he was able immediately to identify tones oo per- 
after he knew how to denote them. Bachem admits that the arae ai 
centage and high degree of absolute pitch in congenitally Blin s case? 
shows that inheritance is not the only factor. Eleven of his 
were blind since birth and in none of these cases were there «fluence 

who had absolute pitch. Bachem also admits that there is a" me 
of forced attention to sound by the blind. : ced by 
Regardless of such evidence, Bachem shows that he is influen 
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traditional views for he thinks that heredity plays some part. “An 
alternate possibility,” he says, “could be that the musical environment 
surrounding these families played an important role” (4, p. 437). He 
rules this out in the case of two families that he studied, because in one 
the grandfather had absolute pitch and the mother was not very musi- 
cal. In another, two musical prodigies studied on a piano which was so 
much out of tune that they had to distinguish between the name of the 
key and the actual tone produced. Considering the first case, there is 
no indication that the child had not associated with the grandfather and 
also according to an interbehavioral explanation, it would not make any 
difference if the child had not associated with anyone who had absolute 
pitch. In regard to the second case, the conditions under which the 
children worked would mean that they would have to attend to ténes 
even more carefully than most students, and this would help, not hinder, 
the acquisition of absolute pitch. 

It is true, of course, that a musical environment may help the de- 
velopment of pitch discrimination because of the added opportunity to 
attend to tones. Absolute pitch occurs at a decidedly higher percentage 
in musical groups, particularly professional musicians. Bachem states, 
“Especially is this true of those with early musical training.” He goes 
on to say, “It seems that attention to musical tones in early youth plays 
a predominant role in the development of absolute pitch’’ (4, p. 439). 

Besides the statements made by these investigators, the results of 
many other experiments show the incorrectness of saying that abso- 
lute pitch is an absolute inborn quality. Meyer and Heyfelder (27) 
attempted to train themselves and their results show a high degree 
of correct judgments. Köhler (18) after two weeks of practice could 
make 112 correct judgments out of 220. Gough (13) trained 90 stu- 
dents and got an improvement. Mull’s data (28) show that practice 
increases the percentage of correct judgments from 40 per cent to 82 
Per cent. Wedell states, “It is evident that some of the later experi- 
Menters are aware that even in absolute pitch, the judgment is at least 
in part one of relation” (46, p. 488). 

Wedell’s experiment led to these conclusions: 


© Relatively unmusical observers can increase their accuracy. 
. Greatest increase is in the first few practice sessions. 
. Limit of ability is an average error of about three semitones. 
à . The course of the learning process is very irregular and there are large 
Individual differences. . , 

5. Unmusical subjects can learn accurately and easily to recoguize tones 
that are eight and one-half semitones, but they fail to judge intervals when de- 


creased to five and one-half semitones or less. 


Pewne 
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Riker (35), in an experiment comparing subjects who said they had 
pitch-judging ability with those who disclaimed it, concluded that 
ability to judge pitch was not confined to specially trained or talented 
subjects. He also concluded that there were various degrees of accuracy 
in judging pitch and that it was “ . . . a function of day to day experi- 
ence with music” (35, p. 346). 

Absolute pitch in children. Very little study has been made on chil- 
dren by early investigators, but Abraham (1) trained three four-year-old 
girls to sing “Ade” to the mode a-d. One had been to kindergarten 
where songs were transposed and she sang it at various pitches but with 
the correct interval; the other two were always correct even after three 
months and one after one and three-quarter years. 

Bennedik (7) had three children who all acquired absolute pitch 
ability but in different measures. Copp (11) taught children to name 
heard tones. From her experience with school children she states that 
80 per cent of all children can acquire absolute pitch. Katz (17) and 
Bennedik (7) substantiate this but not with as high a percentage. 
Komatsu (19) experimented with a child, four years and ten months at 
the beginning of training, who was able, after one year, to discriminate 
46 harmonics consisting of three or four tones. 

A number of experiments as to whether absolute pitch can be ac 
quired by anyone with normal hearing who wants to learn have been 
performed using tuning forks, piano, and oscillators (34). All expert 
ments point to the conclusion that pitch naming ability can be improve 
toa certain extent by training. Since the learning period seldom cover 
more than a few months no greater result could be expected. t 

Probably the most significant and thorough experiment was con 
ducted by Wolner and Pyle (56) to show what effect age and practic? 
have upon pitch discrimination. Music teachers in three elementary 
schools selected pupils showing the greatest deficiency of pitch discrimi 
nation. From this group the seven poorest were selected. No pup! 
selected could distinguish the thirty-vibration differences 0” ne 
On the piano, in general, they could not distinguish differences ° = 
octave, fifth, third, wholetone, or semitone. None of them could sing’ 
although they had been in music classes from the first grade an ich 
at the time of the experiment in the fifth, sixth, and seventh gt 
There were three boys and four girls. ine 

Each pupil received individual instruction and tests for twenty ™ t 
utes each morning, five days a week. The whole number of hours pa d 
in training was on the average 16 for each pupil, extended over 2 P® 
of 81 days. 
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The definition and meaning of pitch, or “high” and “low,” as dis- 
tinguished from intensity, duration and timbre were strongly and re- 
peatedly emphasized, particularly in the early days of training. The 
pupils were led to sce the necessity of thinking of tones as one would 
think of a problem. Interest, attention, and concentration were worked 
for in the method of teaching. 

All seven pupils learned to discriminate perfectly the intervals of 
octaves, fifths, wholetones, and semitones in the range from A, sixteen 
tones below middle C, through A, thirty-four tones above middle Cra 
tonal range of four octaves. There was great variability in the time 
required to reach a given degree of efficiency, and in the response to 
methods and changes of methods. With the forks, four of the pupils 
became perfect in distinguishing all the pitch differences from the larg- 
est, thirty vibrations, down to the smallest difference, one-half a vi- 
bration. Of the other three pupils, one learned to distinguish perfectly 
down to two vibrations difference, another down to three vibrations 
difference, and the third down to five vibrations difference. The stand- 
ard of perfection was 10 out of 10 trials correct. 

Each pupil improved noticeably in ability to sing. At the conclusion 
of the experiment, one pupil sang the words and music of several songs, 
with no trace of pitch deficiency; and also sang major and minor scales, 
chromatics, intervals, and tones picked at random. Another sang scales 
and intervals and the music of a song without words. Two pupils sang 
scales and intervals. The other three pupils sang, not perfectly, but with 
tremendous improvement over their initial efforts. 


CONCLUSIONS 


The statements made by the writers we have quoted and the results 
of the many experiments we have discussed certainly show the need for 
an adequate explanation of absolute pitch. The results are: 


1. Absolute pitch and lesser degrees of pitch discrimination can be acquired 
Y some individuals. Thus, it proves false the idea that absolute pitch is some 
rare power over which we have no control. n J r 

2. Behavior that closely entails pitch discrimination, such as musical experi- 
ence and musical training, allows a much better opportunity to develop keener 
Pitch discrimination. 

3. Pitch discrimination can be acqu 
an indication that the younger the reactiona! 
new reactions. 

4. The development of keen 
velopment of pitch discrimination. 


lind persons and child prodigies. E 
5. The abio ehadal development is brought out by those indi 


p 


ired readily in early age, which may be 
1 biography the easier it is to acquire 


er attending to stimuli makes for a keener de- 
Obvious examples of this are congenital 


vi- 
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duals who have built up in their lifetime keen pitch discrimination and sud- 
denly realize that they are able to perform such behavior. 


We can see that an interbehavioral view is justified by the results of 


past experiments. Basically, the interbehavioral explanation of absolute 
pitch rejects the conception of an inherent faculty or quality. It ac- 
counts for absolute pitch and all pitch discrimination as behavior de- 
veloped within the lifetime of the individual. This means that the 
discrimination is due to the way in which the individual builds UP 
reactions to sounds, and more specifically to the thoroughness in his 
development of attending to tones (stimuli). Accordingly, the indi- 
vidual’s reactional biography, built up from past experience, is the 
important factor in determining what sort of tonal discriminations he 
will make. It follows then that “absolute pitch” is nothing more than 
a fine degree of accuracy of pitch discrimination. The pitch of a tone 18 
the sound that the particular individual learns as that particular tone- 
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SELECTIVE SAMPLING IN PSYCHOLOGICAL RESEARCH 


ELI S. MARKS 
National Office of Vital Statistics 


Through a Jong process of indoctrination, psychologists have been 
made fully aware of the importance of statistical tests of significance in 
the analysis of data. They are, unfortunately, less aware of some of the 
basic requirements underlying these tests. 

Some thought has been given by research workers in psychology to 
the importance of the form of distribution in the population sampled. 
Much less attention has been given to the even more important question 
of the method used in drawing a sample. This article presents two ex- 
amples of difficulties in the interpretation of data arising from methods 
of sample selection fairly common in psychological research (and in 
research in other fields). 

This article is not intended as a criticism of individuals. Residents 
of glass houses should avoid rock-heaving, and the writer's own research 
work has suffered notably from the very defects stressed in the present 
analysis. The writer cannot even promise not to repeat past errors since 
Practical considerations frequently defeat the best of intentions. It is 
important, however, that all of us realize the limitations of our tech- 
niques and not claim for our conclusions a validity not warranted by 
our methods. 

, The theoretical consequences of selective sampling are well estab- 
lished and have been cited frequently. Nevertheless, this theory has 
been ignored fairly consistently in psychological research. All of us have 
been, to a greater or lesser extent, responsible for this situation. The 
examples selected for this article are not unique nor even unusual. The 
two sets of data analysed herein were selected partly from considerations 
of ease in analysis and partly because they are drawn from studies which 
were carefully planned and executed. The writer feels that the results 
of these two studies are generally valid but that it is necessary to point 
out that this validity cannot be conclusively established by the sampling 
techniques used. - 

A fundamental principle of sampling is that random selection must 
be present at some point in the sampling process. Without random 
Selection it is not possible to determine whether the results obtained 
represent basic characteristics of the population or whether they are 
by-products of the sampling process. This fundamental consideration 
has not been sufficiently stressed in psychological methodology, al- 

267 


* 


268 
though many psycholog 


ELI S. MARKS 


ists are conscious of the diffi 


by it ata “semi-intuitive’’ level. 


A large number of psychological studies us 


selected sub-populations as college students, hospital in 
school children, etc. While the use of such sub-populations is perfectly 
these groups have frequently been used to draw 
conclusions not restricted to the sub-population sampled. Such con- 
clusions can be considered valid only when we know the 


proper, studies based on 


the selection of individuals presen 


In the June 1946 Biometrics Bulletin, Berkson (4) 
limitations of fourfold table analysis with respect to data on the re- 
lationship of diabetes and cholecystitis in a hospital p 
assumes a general population in which the two condition 
cholecystitis) are uncorrelated and points out that, if we assign to each 
condition a probability of a patient having the condition coming to the 


hospital for treatment, 


istics may show a correlation even though the two selec 


the hospital population of these 


independently. Such a spurious correlation will exist 
population unless the two selective rates are equal or one 
rates is zero (i.e. the probability of coming to the hospita 
by one of the conditions). Berkson notes: 
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that does not involve the acquisition of data which woul 
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than independent selective factors. College student groups are a se- 
lected population with reference to both intelligence and economic 
status and the selection is undoubtedly not independent for the two fac- 
tors. Assume that a given college admits only U. S. born males in the 
age range 17 to 23 years, and therefore, that the population from which 
the college students are drawn may be defined as all native-born U.S. 
males aged 17 to 23 inclusive. Suppose further that the college admits 
3% of those individuals in the population having I. Q.’s of 120 or more 
and 2% of those individuals coming from families with annual incomes 
of $15,000 or more. If the admission process operated independently 
with respect to the two factors, we should expect that the college would 
admit 4.94%, of the group of individuals having both an I. Q. of 120 or 
more and a family income of $15,000 per year or more (i.e. 3% on the 
basis of I.Q. only, plus 2% of the remaining 97% on the basis of family 
income only). Actually the two factors may reinforce each other so that 
a larger number of individuals with both high I.Q. and high family 
income would apply for and be admitted to the college than would be 
expected from the consideration of the separate probabilities. 

There are presented below selected results from two recently pub- 
lished research studies. The material presented does not represent the 
whole of either study. The present analysis covers only a small part of 
the conclusions of each study. We are concerned only with the particu- 
lar phases of the studies presented herein and not with the validity of 
the studies as a whole. However, there are no facts presented in the 
complete published reports which would require modification of the 
analysis presented herein. 


STUDY I: ALLPORT AND KRAMER, ‘“‘SOME Roots OF PREJUDICE” 


Allport and Kramer (1) studied prejudices of 437 college undergrad- 
uates enrolled in elementary courses in psychology at Dartmouth, 
Harvard and Radcliffe. They note: 

While there is reason to suppose that college students (perhaps especially 
those studying psychology) are disposed to give less prejudiced responses than 
a more unselected population, still there is no ground for believing that their 
life-experiences with which the questionnaire deals, are in any significant way 
atypical. ; 

As will be seen from the analysis presented below, the selection of 
the group studied by Allport and Kramer may very well influence the 
results obtained. 

Allport and Kramer measured prejudice by use of a set of state- 
Ments concerning Negroes, Jews and Catholics. These were scored on 
a 4-point scale in such manner that a high score indicated most prejudice 
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and a low score least prejudice. Table 21 of the report presents the 
following distribution: 


PERCENTAGE DISTRIBUTION OF PREJUDICE SCORES As 
A FUNCTION OF PARENTAL EDUCATION 


Less Prejudiced More Prejudiced 


Both parents college graduates 60.3 39.7 
One parent college graduate 53.0 47.0 
Neither parent college graduate 41.2 58.8 


In the above table “less prejudiced” denotes those subjects with 
scores below the median for the entire group of 437 undergraduates and 
more prejudiced” denotes subjects with scores above the median. 
Allport and Kramer conclude from this table that 
_ +». Students’ prejudice scores bear an appreciable relation to the educa- 
tional level of their parents. This finding is in line with the knowledge we now 
have that higher educational achievement on the part of an individual tends to 
be associated with fewer hostile attitudes. The novelty of the present finding 
consists in the demonstration that the favorable association between educa- 
tional level and freedom from prejudice reaches back into the preceding genera- 
tion: the more educated the parent the more free is the child from prejudice. 


While the above finding may be quite reasonable, it is not the only 
interpretation possible of Allport and Kramer’s results. Table I shows 
(as Part A) three hypothetical populations from which Allport and 
Kramer’s sample might have been drawn. Selection ratios shown in 
Part B of Table I would yield from the hypothetical populations a 
distribution in the sample identical with that presented by Allport and 
Kramer in their Table 21. 

Allport and Kramer's sample could have been drawn (as they assume 
implicitly) from a general population in which children of college gradu- 
ates are less prejudiced than children of parents who are not college grad- 
uates, However, the sample could have come from populations with no 
difference in the degree of prejudice among the three groups (as in 
Hypotheses I and II of Table I) or from a population in which children 
of college graduates were more prejudiced than children of parents who 
are not college graduates (as in Hypothesis III of Table I). 

No one can demonstrate that any one of the three hypotheses pre- 
sented in Table I is correct or that the hypothesis that Allport and 

Tamer’s group represented an unselected sampling of the general popu- 
lation is more or less likely than any of the three hypotheses presented 
in Table I. It js evident that no final conclusions are possible in the ab- 
Sence of information regarding the actual selection rates of the sample. 


272 ELI S. MARKS 


It should be noted that the three hypotheses presented in Table I 
are not unique. In the absence of knowledge about the actual selection 
rates, there are an infinite number of sets of possible populations and 
selection rates which could yield the results obtained by Allport and 


Kramer. 
Srupy II: BERDIE, “RANGE OF INTEREST AND PsyCHOPATHOLOGIES”’ 


Berdie (2) has developed a list of items to be used in measuring range 
of interests. Subjects check on a list of 22 activities, those which they 
“like to do.” The 22 items include such activities as playing checkers, 
dancing, going on dates, hiking, reading, playing baseball, watching 
baseball, etc. Berdie administered this check list (orally to some groups 
and in printed form to others) to recruits at a U.S. Marine Corps recruit 
training station. He found that recruits recommended for inaptitude 
discharges (by the psychiatric unit) checked significantly fewer interests 
than did “normal” recruits. 

In a second study (3), Berdie reports on range of interests for 
group of 228 marine recruits recommended for inaptitude discharges: 
Berdie notes that the “abnormal” recruits contained “a heterogeneous 
collection of diagnoses, including various types of neuroses, constitu- 
tional states and organic conditions.” For each diagnostic groups 
Berdie presents separate mean interest scores. 7 

Questions can, of course, be raised regarding the adequacy of diag- 
noses and the meaning of the diagnostic categories used. This fact p= 
itself, operates as a “selective factor.” Berdie states that the ni se 
of cases in each diagnostic group “can not be interpreted as show!n& Š He 
frequently these diagnoses occur in the Marine Corps oF Nava ee 
ice, as no attempt was made to obtain a representative sample , ë 
is obvious, of course, that even a representative sample of i 
Corps recruits with a given psychiatric diagnosis is not necessar? A 
representative sample of all individuals in the popul 
given condition nor even of all individuals in the Marine Corpsin @ : ave 
psychiatric category (since undetected conditions mus sly © 


existed). 


a 


ed 
Berdie’s group included 13 men classified as “epileptic” who ek 
a mean interest score of 10.23 items checked and 20 men cla howe 
“constitutional psychopathic state-inadequate personality” who ie" t 
a mean interest score of 5.80. These two groups had respectiv? i 
highest and lowest mean interest scores of the 10 diagnostic To dif- 
reported in the study. Berdie states that “the probability of 
ference” lies between the .01 and .02 levels. 

In Table II are presented two hypothetical 


of 
populations and set 
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TABLE II 


: POPULATIONS AND SELECTIVE Factors WHICH COULD 
Propuce RESULTS SHOWN BY BERDIE 


A. HYPOTHESIS I 


| 
vy 
l CPS-Inadequate Personality Epilepsy 
i 


Interest Population Selection Distribution Selection Distribution 

Score Distribution Rates in Sample Rates in Sample 
0 .03479 .002868 .0998 .001185 .0412 
1 03715 002624 0975 001179 .0438 
2 .03930 .002392 -0940 001171 .0460 
3 04122 002172 0895 -001159 .0478 
4 04291 .001960 0841 001145 0491 
L 5 .04438 001760 .0781 .001129 .0501 
6 04563 001568 .0716 001110 .0507 
7 04663 -001392 .0649 -001090 .0508 
8 -04743 -001224 0580 -001069 .0507 
9 .04800 -001064 -0511 .001046 .0502 
10 04834 000920 20445 001023 .0495 
i 11 04844 000784 0380 ` -001000 -0484 
| 12 04834 .000660 0319 000977 0472 
. 13 -04800 000544 .0261 000954 -0458 
| 14 04743 000444 -0210 -000931 -0442 
i 15 -04663 000348 -0162 -000910 -0424 
16 .04563 .000268 .0122 .000890 -0406 
17 04438 -000200 0089 .000872 .0387 
$ 18 .04291 .000140 .0060 .000855 .0367 
19 04122 .000088 .0036 000841 0347 
20 £03930 -000052 .0020 .000829 .0326 
21 03715 000024 -0009 .000821 .0305 
22 .03479 .000008 0003 -000815 0284 
Total 1.00 1.00 1.00 
Mean 11.00 5.80 10.23 
| Variance 40.00 20.65 39.41 
S. D. 6.32 4.54 6.28 

B. HYPOTHESIS II 
0 02 002340 0468 000376 0075 
1 .03 002121 .0636 -000383 0115 
2 104 001912 .0765 .000407 -0163 
3 .05 2001715 .0858 000446 0223 
4 .07 001529 .1070 -000500 .0350 
5 08 001355 1084 000571 0457 
6 “09 2001191 1072 .000656 -0590 
7 10 -001039 +1039 -000758, .0758 
8 10 -000898 ..0898 000875 -0875 
9 ‘09 -000769 .0692 .001007 -0906 
10 “08 000650 .0520 001156 0925 
| 11 ‘06 -000543 .0326 001319 -0791 
12 .05 -000448 .0224 -001499 -0750 
13 -04 -000363 -0145 -001694 .0678 
14 .03 000290 .0087 001905 0572 
15 -02 -000228 .0046 -002131 -0426 
, 16 “02 1000177 .0035 -002373 .0475 
(e 17 ‘01 “000138 0014 -002630 -0263 
j 18 .01 -000110 -0011 -002903 -0290 
| 19 ‘ot “900093 “0009 "003192 10319 
Total 1.00 1.00 1.00 
Mean 7.95 ne na 
ariance 3 11.78 5 

O SD. tet 3.43 4.32 
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selective sampling rates for individuals diagnosed as epileptic and as 
“CPS-inadequate personality.” In both hypotheses it is assumed that 
the general populations for both psychiatric groups have identical inter- 
est score distributions but that a difference in selection rates for the 
sample exists. Such differences might arise from any of the selective 
factors noted above (ie. selection from the general population for 
Marine Corps service, selection from Marine Corps inaptitude dis- 
chargees for inclusion in the sample or faulty diagnosis). It might also 
arise from an association between range of interests and likelihood of 
detection of the condition, e.g. “constitutional psychopaths with inade- 
quate personality” with a small range of interests might be more apt to 
come to psychiatric attention than those with a large interest range 

Whatever the true situation may be, we cannot conclude that differ- 
ential sampling did not occur without specific data to demonstrate this 
fact. We cannot even assume that random selection was just as likely 
in this situation as differential selection. Such an assumption is the 
equivalent of the old fallacy of assuming that two events are equally 
probable in the absence of information regarding the actual probabilities 
(and then proceeding to demonstrate that there is almost certainly 
mammalian life on Mars). 

From Table II it can be seen that the differences in mean score 0b- 
tained by Berdie could have arisen through selective sampling: The 
result is not confined to Berdie’s data. No confidence can be placed 10 
obtained differences (regardless of “measures of significance”’) unless m 
experimenter has controlled carefully the method of sample selecto 
and knows the probabilities of selecting a given element. If we ait 
such knowledge, we can make proper estimates of population paran 
ters and attach to our statistics properly determined 
sampling error. Without such knowledge, generalization from f sig 
to population is extremely risky- Since the application of a test af af 
nificance implies generalization from sample to population, all © on 
most valuable statistical techniques are meaningless without proper ¥ 
trol of the sampling method. 

It should be noted that a result which applies 
applies equally well to coefficients of correlation, anal 
factor analysis, etc., since these techniques g0 back basically t° ` jing 
ences in group means. It should be noted also that selective sa en 
affects the variance within groups as well as the variance tha 

A sae less 
groups. In Table I the variance within the groups sampled 1s 
the population variance in three cases and greater in the fourt ‘mined 
effect of selective sampling on total variance can on 


to sample ” 
f vari 
yses o sifer- 


“es 


SAMPLING IN PSYCHOLOGICAL RESEARCH 275 


from a knowledge of the number in each group and the selective rates 
for each group. Thus, measures of correlation may be affected by se- 
lective sampling by distortion of both variance within groups and vari- 
ance between groups. 


SUMMARY ` 


Through the use of two published psychological research studies, the 
possible effects of selective sampling are emphasized. The present analy- 
sis stresses the importance of control of the sampling process if valid 
generalizations are to be drawn from research data. 
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ADDITIONAL TABLES OF THE PROBABILITY OF “RUNS” 
OF CORRECT RESPONSES IN LEARNING 
AND PROBLEM-SOLVING 


DAVID A. GRANT 
University of Wisconsin 


In a previous paper (2), the writer demonstrated how “runs” of 
“correct” responses and grouping of “correct” and “incorrect” responses 
could be used as criteria of learning or problem-solution in experiments 
involving repeated trials. The present paper has a double purpose: (a) 
to extend the tables of the probabilities of “runs” given in the earlier 
article; and (b) to discuss an important aspect of repeated statistica 
tests. A 
As pointed out in the original paper, a “run” of correct responses 1” 
a properly controlled learning or problem-solving experiment may per- 
mit the experimenter to conclude (at a specified confidence level, e) that 
his subject has learned or solved the problem. A brief table of prob- 
abilities of chance runs was published in the original paper. Probabilities 
were given for various length runs of correct responses in total sequences 
of 16, 20, 25, and 50 trials, where the probability of a chance success 
on a single trial was 0.5. Because of the usefulness of the runs nee 
the limited table of the previous article has been extended in Table a 
the present paper. In Table I, the probabilities of runs are given he 
total sequences of 16, 20, 25, 30, 35, 40, 45, and 50 trials where t% 
probability of a chance success on a single trial is 4, 4, 4, OF 3? ble 

Two obvious trends will be noted in the probability entries of Ta he 
I, and both of these trends have direct implications applicable to a 
effective use of the runs criteria. First, the probability of any given a 
of correct responses decreases markedly as the probability (p Fel 
chance success on a single trial becomes small. Second, with any spe? 

p, the probability of any given run of correct responses increases W 


ith 
an increase in the total number of trials in the series. 
wed how 


1 Child (1) has subsequently pointed out an error in the original paper and pro norma 
Yates’ correction for continuity would, under certain conditions, improve the 
approximation formulae for the binomial. : Glissendorf, 

2 The entries of Tables I and II were computed for the writer by Marie H:i rtment, 
working under the supervision of Professor K. J. Arnold of the Mathematics Pepa S 
Director of the Computing Service of the University of Wisconsin. Each pone ies in 
computed from one or more of the formulae given by Uspensky (3, Ch: > ple I of the 
Table I correct some errors in the fourth decimal place which occurred in 14 


earlier paper (2, p. 278). 
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THE Propasitity Or A Run or AT Least S Successes IN N TRIALS WHEN THE 
PropABILity Or A Success In A SINGLE TRIAL Is p. 


p=1/2 
N 
š 16 20 25 30 35 40 45 50 
5 1965 
6 .0929 1223  .1578  .1918 
7 0429 0582 0770 «0953-1134 .1310 1484. 1653 
8 0195 .0273 .0369 .0464 .0558  .0652 .0744  .0836 
9 0088 0127 .0175 .0224 .0272 0320 0367 ~=—. 0415 
10.0039 0059 0083 .0107 .0132 «0156 .0180 .0204 
11 0017 .0027 .0039 .0051 .0063 .0076 .0088  .0100 
12 0007 .0012 .0018 .0024 .0031 .0037 .0043 «0049 
13 0003 .0005 .0009 .0012 0015 .0018 .0020 .0024 
14 0001 .0002 .0004 .0005 .0007 .0009 .0010 .0012 
15 0000 .0001 .0002 .0003 .0003 0004 .0005 .0006 
p=1/3 
4 1084 1384 .1745 .2090 .2421 .2739 
5 0341 .0448 .0580 .0711 .0839 .0966 .1091 — 1214 
6 .0105 .0141 .0i87 .0232 .0277 .0321 0366 = .0410 
7 0032 .0044 .0059 .0075 .0090 .0105 .0120 .0135 
8 0010 .0014 .0019 .0024 0029 .0034 .0039 .0044 
9 .0003 .0004 .0006 .0008  .0009 0011 .0013 0014 
10 0001 .0001 0002 .0002 .0003 .0004 0004 .0005 
p=1/4 
3 1601 .2002  .2416  =:2922  .3342 
4 0387 .0501 .0641 .0779 .0915 1048 1180 1310 
5 0090 0119 .0156 .0192 .0228 .0264 0299 .0335 
6 (0021 .0028 .0037 .0046 .0055 .0065 .0074  .0083 
7 0005 0007  .0009 0011 .0013 0016 .0018 .0020 
8 0001 0002 .0002 .0003 .0003 .0004 0004 .0005 
p=1/5 
2 4107 
3 0889 1124  .1410 .1687  .1955 2214 2465 ` 2712 
4 0169 .0219 .0282 .0345 .0407 .0468 .0529 0590 
5.0031 0042 0054  .0067 .0080 0092 0105 .0118 
6 0006 .0008 .0010 .0013 .0015 0018 .0021  .0023 
7 0001 0001 .0002 .0002 .0003 .0004 .0004 .0005 
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The first feature mentioned above becomes important when it is de- 
sirable to minimize over-learning in experiments in which the subject 
is likely to show sudden “insightful” learning or problem-solution. In 


TABLE II 


THE LARGEST NUMBER OF TRIALS For Wuicu THE PROBABILITY OF A Run OF AT 
Least S Successes Is Less THAN € WHEN THE PROBABILITY 
Or A Success ON A SINGLE TRIAL Is p. 


p=1/2 p=1/3 
th £ 
(001 .005 .010 .050 -100 hS .o01 .005 .010 .050 -100 
s s 
4--- 7 5 4 — — — 8 14 
= = 6 9 a. § 7a A 
6 es Se o 6 s = 91i o 
y a ee 7 3 21 38 173 350 
¢ = 8 i a 59 a 16 55 105 510 1042 
9 = Ë 17 9 113 9 37 155 304 1521 3117 
10 10 18 28 112 22 10 97 452 898 4551 9340 
11 13 29 50 218 439 
12 18 Si 92 429 837 
13 27 93 175 850 1735 
14 44 176 341 1692 3463 
15 78 341 671 3374 6917 
p=1/4 p=1/5 
£ 2 100 
(001 .005 .010 .050 .100 ‘001 .008 .010 .050 ' 
s s 
3 - - — 5 0 j =a = 2 - 
4 — 4 6 19 38 5 = = E 2 ig 
5 5 10 17 73 146 2 = 6 W 2 gy 
6 10 32 59 284 579 5 y 23 42 %3 ogy 
7 27 115 225 1125 2306 6 24 102 200 1006 10294 
8 94 444 884 4486 9208 7 103 495 987 5014 


l os single 
such cases, a reduction of the probability of a chance success g ning: 
trial? will reduce the length of the run necessary to indicate pehavior 
Short runs might also be of value in detecting “hypothesis 


f 5 yior. 
and other theoretically consistent but evanescent forms of beha w 
. hed 
2 Reduction of the a priori probability of a chance success may be accomplis 
adding to the number of possible but incorrect responses. 


À 
f 
' 
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The second feature mentioned above leads to a difficulty when train- 
ing is to be given until the subject’s performance meets any constant 
criterion. Suppose, for example, an experimenter is running 25 trials a 
day on a simple discrimination problem in which the subject has to pick 
the correct one of two objects. He notes that p=}, and from Table I, 
that a run of eight or more correct responses (P =.0369) would be sig- 
nificant at the 5% level and decides to use this as his criterion of learn- 
ing. Suppose that his subject fails to achieve this criterion the first day. 
The experimenter might then decide to continue training until, on some 
day, the criterion is met. With sufficient persistence on the part of the 
experimenter, the probability of achieving this criterion will approach 
1.00 even though the subject always responds randomly with respect to 
the critical cue, throughout the experiment. Correct use of the runs 
criteria requires that all trials within a previously selected limited series 
be taken into account. This condition may be met in any of a number of 
ways. For example, the experimenter may decide in advance to limit his 
experiment to a given number of trials, say, 500. Or, again, he may 
decide in advance that the 25 trials shall be the one test series. Some 
limit must be set—in other words, failure of the experiment as well as 
its success must be made possible. 

To make convenient the application of the runs criteria when a 
large total number of trails is contemplated, Table II is presented in 
which the maximum series length is given such that a run of at least S 
correct responses is significant at the given confidence level, €. 

In using Table II, the experimenter described above would presum- 
ably decide, in advance, that if the subject has not achieved the criterion 
in a certain number of trials, say, 429, the experiment will be termi- 
nated. If 429 were selected, the significant (e<.05) run would be 12 
correct responses. Here the daily records would be pooled so that arun 
of correct responses might extend over two days or two 25-trial series. 
Again it will be noted, that had the single trial probability of a chance 
ra been smaller than 3, the significant run would have been quite 
Short, 
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ENCYCLOPEDIA OF PSYCHOLOGY 
A SPECIAL REVIEW* 


S. RAINS WALLACE 
Tulane University 


This volume is an attempt to present the field of psychology in 
encyclopedic form. It consists of approximately 100 articles represent- 
ing 82 contributors. Some of the articles discuss such broad subjects a§ 
Child Psychology, Emotion, Personality, or Motivation. Others —for ex- 
ample, Aspects of Blindness, Psychological Psychodrama and National 
Council of Women Psychologists—are confined to relatively narrow am 
specialized topics. In general, emphasis is placed upon contemporary 
problems, approaches, and findings, although an occasional article pre- 
sents a somewhat exhaustive historical account. Some contributors at- 
tempt to give a thorough review of their topics with carefully annotate 
references to the literature. Many writers content themselves with 
general discussions in which their own interests and theories take prece- 
dence over the work of others. From the viewpoint of both subject 
matter and style, therefore, there is an inexcusable lack of uniformity O" 
consistency. 

In a preface, the editor writes that one purpose of this volume was 
“to emphasize some of the trends in contemporary psychology 
which seem to have supplanted much of the traditional material.” THe 
appears to indicate that some general plan was followed in determini" 
the apportionment of space to the various subjects. Unfortunately, SUC Ss 
a plan is not readily discernible. Certainly one hesitates to believe tha 
the allotment of more space to Parapsychology and Pseudopsychology 
than to Learning, Experimental Psychology, Industrial Psychology os 
Aptitudes and Aptitude Testing is indicative of the Editor’s opinions 
about current trends in the field. While some plan may be discovered ap 
the fact that topics which relate to abnormal psychology and psychiatry 
are assigned over 200 pages of the 897 total, one must W if it were 
really planned that Psychosomatics should receive 43 pages it to 
therapy, 16, and Religion and Psychiatry, 11. Similarly it is difficu a 
grasp a plan which leads to the inclusion of separate sections 08 a y 
of Reference in Psychology, Phenomenology, Philosophical Bi, 
Points of View, Scientific Law in Psychology, and Systematic P. sycholog?" 


2 h- 
* HARRIMAN, Priuir L. (Ep.) Encyclopedia of psychology. New York: The PhilosoP 


ical Library, 1946. Pp. vii+897. 
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Equally difficult is the explanation of a 13-page section on Language and 
Psychology, a 9-page section on Language, Psychology of, and a 33-page 
section-on Semantics, Language. When these assignments are compared 
to the 11 pages given to Learning, the 4 pages given to Industrial Psy- 
chology, and the 16 pages given to Personnel, Teacher, the existence of 
any plan at all is placed in doubt. . 

This doubt is not dissipated by a closer examination of the book. 
Instead, as in “Alice in Wonderland,” things get ‘“‘curiouser and curi- 
ouser.”” It is a disturbing discovery that 8 pages are devoted to a dis- 
cussion of the Eye, including some material which is largely duplicated 
in a section on Color, but that only 4 pages are given to an anatomical 
description of the ear, allowing a few paragraphs in the section on Re- 
ceptors and Effectors for the facts of audition. Does this mean that cur- 
rent trends in psychology have forced hearing out of style? Is there sig- 
nificance in the fact that Jung is the only psychologist to appear in a 
section under his own name or that he, Adler, Porteus, and Rorschach 
are the only individuals referred to in the index? 

Elsewhere there is further evidence that the editor failed to plan 
or, indeed, to edit. According to the Preface, the volume was intended 
“to meet the requirements of the serious investigator who wishes 
to acquaint himself with various topics in modern psychology which lie 
outside his field of special interest and competence.” It was also 
supposed to be serviceable “in the enrichment of various topics by ini- 
tiating the student into an intensive exploration of the primary data.” 
How will the serious investigator react when he finds that some articles 
are accompanied by complete bibliographies while others have none? It 
is to be hoped that he will not suppose that bibliographies are omitted 
only for those topics for which publications are lacking. Certainly, it 
would be unfortunate if a student arrived at that conclusion when he 
discovered his intensive exploration of the primary data in the field of 
learning blocked by the absence of a bibliography while one of 37 titles 
beckoned him into the field of Portews Maze Tests. 

Other evidence of inadequate editing is found in the fact that the 
bibliographies are presented in inconsistent formats which appear to 
depend upon the habits and vagaries of the individual contributors. 
This defect is only one of a number of symptoms. For example, the sec- 
tion on Genetic Psychology, Experimental, refers by name and number 
to over 180 sources but the editor somehow failed to append the bib- 
liography. These faults do not increase the reader’s confidence. They 
do increase the sympathy he feels for the contributor who, in her article 
on Color, blithely wrote ‘‘See heredity,” and who now knows that there 
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is no section on heredity. The “index” of this encyclopedia is a 3-pase 
table of contents with an attempt at cross-indexing which is exemplifie 
by the entry, “Ear—see Eye; Receptors and Effectors.” Incidentally, 
the “index” includes no page numbers. 

Some may feel that these criticisms are of a petty variety- They are 
presented, however, to support a grave charge. In brief, it is believe 
that the contents of this volume were amassed by the process of selecting 
certain contributors, and in general, of allowing their own interests, free- 
dom of time, notions of style, etc., to determine what, how, and how 
much they wrote. It is submitted that even if contributors are max- 
imally expert in their particular fields, a useful reference volume cannot 
be produced unless someone assumes responsibility for over-all plr 
ning and presentation. In this case we may question not only the dis- 
charge of this responsibility but also the assertion of the Editor that 
these articles are written “by those who have achieved distinction 
through contributions to special areas within the science of psychol- 
ogy.” Many of the contributors are indeed eminent in the field of which 
they write. But many others are relatively unknown. Whether the 
writers deserve to be unknown is not to be discussed here. We mo 
however, leave it to the reader to reconcile the editor’s statement with 
the fact that the section on Emotion, Frames of Reference in Psychology 
Jung, Personal Document Analysis, and Phenomenology are the work ° 
a single contributor. 

Admittedly, this is not a pleasant review. It is not intended to Re: 
The title of this book is such as to place it almost inevitably in every 
college library and in the hands of many laymen who honestly desire “o 
extend their knowledge of the field. At its best it will give students 2? 
laymen alike a mistaken notion about the nature of psychology a 
science. At its worst, it will make them think they know more than they 
do or will make the more perspicacious of them think psychologists agree 
even less than they do. The thinking student who reads the section O” 
Conditioning and is told that “ . . . Hull has formulated by far the most 
extensive and consistent theory of learning thus far offered .-- magga 
ing to perceive that something is radically wrong when he reads the 
section on Learning and finds that Hull’s work is not mentioned. aaa 
thing is surely wrong; but with the book, not with the field. Should ° 
something be done to protect psychology from this sort of thing? 


oe 


THE RELATION OF EMOTIONAL ADJUSTMENT TO INTEL- 
LECTUAL FUNCTION—A NOTE 


ROBERT E. HARRIS AND CLARE WRIGHT THOMPSON 


University of California Medical School, Department of Psychiatry and 
The Langley Porter Clinic 


It has often been asserted that changes in a child’s emotional adjust- 
ment are reflected in corresponding changes in his tested intelligence. 
Most clinical psychologists have had occasion to report that a measured 
IQ has seemed too low an estimate of a disturbed or anxious child’s 
“real” intelligence. This reservation is usually based on observations of 
anxiety and non-intellectual attitudes which operate to decrease effi- 
ciency in the test situation. The statement implies that if these factors 
were not present the measured IQ would be higher. Since tests of intel- 
lectual ability are usually standardized on non-clinic populations in 
whom these factors are minimal, it is sometimes necessary to make such 
qualifications. 

However, it is commonly asserted that there is a more pervasive kind 
of impairment of intellectual function which operates to lower efficiency, 
not only in the test situation but in other situations demanding intel- 
lectual effort as well. It is further asserted that changes in the life situa- 
tion operating over the course of time, or expedited by psychotherapy 
or other manipulation, so modify these inhibiting factors that subse- 
quent IQ's are significantly different from the initial measure. Demon- 
stration of the existence of such factors requires, of course, measurement 
Over a period of time of changes which are greater than might be ex- 
pected from the inadequacies of the test and the difficulties inherent in 
testing emotionally disturbed children. 

The most recent study purporting to demonstrate such changes is 
that of Despert and Pierce (1). Their purpose was to “ascertain whether 
any relation can be established between emotional adjustment as shown 
in the total record and intellectual function as determined through psy- 
chometric testing” (p. 7). The revised Stanford-Binet was administered 
to 39 children before and after attendance at nursery school. Available 
also was information from which they made evaluations of personality, 
of socio-emotional adjustment and of the family situation. After the 
administration of the final test, the records were examined for factors 
Which might account for observed increases or decreases in IQ. Ac- 
Cording to the authors: “The results of the psychometric tests and the 
findings related to socio-emotional adjustment were analyzed inde- 
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is no section on heredity. The “index” of this encyclopedia is a 3-page 
table of contents with an attempt at cross-indexing which is exemplified 
by the entry, ‘‘Ear—see Eye; Receptors and Effectors.” Incidentally, 
the “index” includes no page numbers. 

Some may feel that these criticisms are of a petty variety. They are 
presented, however, to support a grave charge. In brief, it is believed 
that the contents of this volume were amassed by the process of selecting 
certain contributors, and in general, of allowing their own interests, free- 
dom of time, notions of style, etc., to determine what, how, and how 
much they wrote. It is submitted that even if contributors are max- 
imally expert in their particular fields, a useful reference volume cannot 
be produced unless someone assumes responsibility for over-all plan- 
ning and presentation. In this case we may question not only the dis- 
charge of this responsibility but also the assertion of the Editor that 
these articles are written “by those who have achieved distinction 
through contributions to special areas within the science of psychol- 
ogy.” Many of the contributors are indeed eminent in the field of which 
they write. But many others are relatively unknown. Whether the 
writers deserve to be unknown is not to be discussed here. We may, 
however, leave it to the reader to reconcile the editor’s statement with 
the fact that the section on Emotion, Frames of Reference in Psychology, 
Jung, Personal Document Analysis, and Phenomenology are the work of 
a single contributor. 

Admittedly, this is not a pleasant review. It is not intended to be. 
The title of this book is such as to place it almost inevitably in every 
college library and in the hands of many laymen who honestly desire to 
extend their knowledge of the field. At its best it will give students a” 
laymen alike a mistaken notion about the nature of psychology aS ê 
science. At its worst, it will make them think they know more than they 
do or will make the more perspicacious of them think psychologists agree 
even less than they do. The thinking student who reads the section on 
Conditioning and is told that “ . . . Hull has formulated by far the most 
extensive and consistent theory of learning thus far offered . . . ” is 80” 
ing to perceive that something is radically wrong when he reads the 
section on Learning and finds that Hull’s work is not mentioned. Some- 
thing is surely wrong; but with the book, not with the field. Should not 
something be done to protect psychology from this sort of thing? 
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THE RELATION OF EMOTIONAL ADJUSTMENT TO INTEL- 
LECTUAL FUNCTION—A NOTE 


ROBERT E. HARRIS AND CLARE WRIGHT THOMPSON 


University of California Medical School, Department of Psychiatry and 
The Langley Porter Clinic 


It has often been asserted that changes in a child's emotional adjust- 
ment are reflected in corresponding changes in his tested intelligence. 
Most clinical psychologists have had occasion to report that a measured 
IQ has seemed too low an estimate of a disturbed or anxious child’s 
“real” intelligence. This reservation is usually based on observations of 
anxiety and non-intellectual attitudes which operate to decrease effi- 
ciency in the test situation. The statement implies that if these factors 
Were not present the measured IQ would be higher. Since tests of intel- 
lectual ability are usually standardized on non-clinic populations in 
whom these factors are minimal, it is sometimes necessary to make such 
qualifications. hak 

However, it is commonly asserted that there is a more pervasive kind 
of impairment of intellectual function which operates to lower efficiency, 
Not only in the test situation but in other situations demanding intel- 
lectual effort as well. It is further asserted that changes in the life situa- 
tion Operating over the course of time, or expedited by psychotherapy 
or other manipulation, so modify these inhibiting factors that subse- 
quent IQ's are significantly different from the initial measure. Demon- 
Stration of the existence of such factors requires, of course, measurement 
Over a period of time of changes which are greater than might be ex- 
Pected from the inadequacies of the test and the difficulties inherent in 
testing emotionally disturbed children. i 

The most recent study purporting to demonstrate such changes is 
that of Despert and Pierce (1). Their purpose was to ‘‘ascertain whether 
any relation can be established between emotional adjustment as shown 
1n the total record and intellectual function as determined through psy- 
Chometric testing” (p. 7). The revised Stanford-Binet was administered 
to 39 children before and after attendance at nursery school. Available 
also was information from which they made evaluations of personality, 
of socio-emotional adjustment and of the family situation. After the 
administration of the final test, the records were examined for factors 
Which might account for observed increases or decreases in IQ. Ac- 
Cording to the authors: “The results of the psychometric tests and the 

ndings related to socio-emotional adjustment were analyzed inde- 
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pendently” (italics theirs). However, they do not use the word “‘inde- 
pendently” in its usual sense; apparently they did not attempt to pre- 
dict the measured intellectual changes from personality data alone. 
Rather, the procedure seems to have been to examine the test-retest 
differences, to select those cases showing greatest change and then to 
examine the other data for possible reasons. In every case showing de- 
cline it was possible to find some plausible reason such as parental dis- 
cord, the birth of a sib, a hostile nurse, etc. The factors associated with 
rises were more obscure, the phrase, “improved emotional and social 
adjustment,” appearing several times. The authors do not hesitate to 
ascribe causal effect to these factors. 

However, examination and recalculation of their data indicate that 
the changes can be explained by a simpler hypothesis. It is possible, 
knowing only the initial IQ's and the standardization data of the test, tO 
predict for the group as a whole the mean IQ and its variability on @ 
second test. It is possible also to predict for any given IQ range within 
the group both the direction and the magnitude of change on a secon 
test. These predictions agree almost perfectly with the results presented 
by Despert and Pierce. For example, there are two cases with IQ's in 
the range above 150; one child tested at 155, another at 157. On statisti- 
cal grounds alone (see below), one would predict that on a retest (an 
alternative form administered the same day) initial IQ's of 155 would 
decline to 142, and initial IQ's of 157 to 143. The actual findings are 141 
in the first case and 144 in the second. These two are among the cases 
for whom the authors produce data accumulated over a period of time 
to account for a “significant” change (10 points or more). Also on sta 
tistical grounds, one is not surprised to find that those cases showin 
significant declines averaged 34 points higher initially than a group 
showing comparable rises. 

The initial mean IQ of the ten cases selected by Despert and Pierce 
as showing a significant change downward is 140 and that of the group 
changing upward 106. The uncorrected correlation ratio between initia 
score and loss is .80 (N =39, P <.001). Apparently selecting for direc- 
tion and size of change has isolated groups initially quite different from 
each other. However, on the second test, these groups approach eae 
other very closely; the group showing losses averaged 127 and the grouP 
showing gains, 121. Thus both groups tend to approximate the mean O 
all cases. A conclusion one might draw is that if a child scores hig 
initially, then bad things are going to happen to him, and if he sco” es 


low, good things are going to happen so that he ends up closer to the 
sin term 


mean of the group 1n either case. A more credible explanation i 
of statistical regression. 
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Statistical regression is the name for the phenomenon that a group 
of scores selected because of their deviation from the mean of a popula- 
tion will, on a second test, more closely approximate the population 
mean. A score on a test may be thought of as having two components: 
the “true” amount of the trait measured plus a variable error. If from 
a group of test scores, one selects the highest, the effect is to increase the 
probability of selecting those which have associated with the true score 
the greatest amount of plus error. On a second test, the errors would be 
just as likely to be in the opposite direction and consequently the second 
set of scores would be lower. The converse would be true for scores below 
the mean. This principle is nicely illustrated by the Despert-Pierce 
data. It is possible to predict precisely the amount by which any se- 
lected group of scores will regress toward the mean. Since the amount 
of regression varies inversely with the reliability of the test, it is neces- 
sary to know the reliability coefficient.1 The only other data needed are 
the initial mean and standard deviation for the total group, and, if 
they change significantly, these same values for the second administra- 
tion. 

In the Despert-Pierce data the means from the two administrations 
were within one point of each other. The standard deviations varied by 
the amount and in the direction to be expected for children of this age 
(3). Predictions of regressed mean values were made for various 1Q 
levels from the usual regression equations. If the total group is divided 
at the mean, those scoring above the mean averaged 135 on the first 
test. The predicted mean for the second test was 130, exactly the same 
as the obtained figure. For the cases scoring below the mean the average 
Score was 105 on the original test and 114 on the second test. The pre- 
dicted mean was 112. This is an insignificant difference (P equals 0.80 
for 17 d. f.). Breaking down the groups more finely gives the data 
Shown in Table I. 

Examination of this table makes it clear that those levels farthest ` 
from the mean regressed the most. With this small number of cases, it 
is impressive how closely the predicted values approach the observed. 
Obviously no hypothesis other than regression is necessary to account 
for the changes. To isolate particular individuals from these groups as 


1 The test-retest correlation (rho) calculated from the Despert-Pierce data is 69. 
Because of the time interval between the tests and the (probably) different functions 
Measured at the two age levels, this figure cannot be accepted as a measure of reliability. 
In the calculations to follow we have used the reliability coefficients appropriate to age 
and IQ levels as reported by McNemar (3). When we made the same calculations using 
the value .69, the observed and predicted values were even more similar than those re- 
Ported in the text. 
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showing spectacular changes is to select cases where the errors have 
been in opposite directions on the two administrations, and to assign 
psychological significance to statistical artifact. 

We have labored this criticism so far because of the importance of 
the problem which Despert and Pierce purported to study. That what 
are called socio-emotional factors interfere with (or facilitate) intellec- 
tual achievement there can be no doubt. As an example one might cite 
the nine percent of men in the Stanford Gifted Group who as young 


TABLE I 


COMPARISON Or ACTUAL Retest MEANS AND THOSE 
PREDICTED FROM REGRESSION EQUATIONS 


3 Mean of Mean of second test Dis- 
IQ Range N first test Predicted Observed crepancy 

= E ia : pinan 

67-103 7 93.0 104.4 106.3 +1.9 

104-121 11 113.0 117.3 118.9 41.7 

122-139 15 129.7 127.4 125.8 -1.6 

140-157 6 147.7 137.8 139.2 +14 
T O 


* Class intervals based on size of standard deviation. 


adults failed to rise above the level of semi-skilled labor (4). Similarly 
every one knows examples of superior achievement in individuals of 
mediocre ability. But that the 1Q itself is subject to socio-emotional 
factors is not obvious nor, as far as we know, has it been demonstrated. 
Proof would require an experimental design less contaminated and sta- 


tistically more sophisticated than the study discussed here. Our recal- 
culation of the data offers strong presumptive evidence that the influ- 
ence of such factors is minimal. 

ladjust- 


There remains the possibility that more severe forms of ma 
ment than are represented ina nursery school population may depress 
measured IQ’s and that more intensive therapy may raise them. Pub- 
lished data on this point are equivocal at best (2). Our own data, base 
on 30 clinic children, are too unwieldy for a simple quantitative sum 
mary. Study of individual cases, taking account of age and 1Q leve 
(factors related to the size of the standard error), special handicaps» 
conditions of testing, etc. has led to the tentative generalization oa 
within wide ranges of different kinds of personality and behavior pro?” 
lems,? and within a wide range of “cooperation” in the test situation, ĉ 


z e % . di or 
2 We do not mean to include here psychotic conditions, or organic brain disease 


other physical conditions known to affect cortical function. 
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measured IQ remains fairly stable, even after intensive therapy and im- 
provement in adjustment. The fact that changes in IQ have been so 
small has often been disturbing to therapists working with the patients. 
A number of factors are at work here. First there is the fact that in our 
culture a high IQ is a “good thing.” Second, a therapist has some per- 
sonal stake in the child and his improvement—by identification with 
the child, as a validation of his therapeutic skill, etc. Third, because 
there are no unambiguous indices of improvement in adjustment, the IQ 
is frequently seized upon, erroneously, of course, as such a measure. 
This is an unfortunate state of affairs, particularly because not only 
psychotherapists, but also educators, workers in social agencies, and 
others are motivated to look eagerly for rises in IQ. 

A further implication of the principle of regression for clinical 
should be mentioned. Schools for mental defectives and other agencies 
concerned with children of below average intelligence often have the ex- 
perience of obtaining higher IQ's on re-testing than those reported by a 
referring agency. This is a consequence of the fact that a low IQ is one 
of the factors for which children are referred to such agencies. If the 
principle of regression is not understood, there are at least two possible 
misinterpretations of the higher IQ's on re-test. If the second test is 
given shortly after the first, doubt may be cast on the adequacy of the 
first testing. If the second test is delayed until after a period of training 
or therapy, the conclusion may be drawn that the program has effected 
the rise in IQ. As in the case of the Despert-Pierce data, the effect of re- 
gression is to move the scores closer to the mean of the group from which 
they were selected, in this case, the mean of the general population. 

It is easy to understand the motivation of those who would like to 
raise IQ's. It should be obvious, however, that improved socio-emo- 
tional adjustment is an end in itself, and any program effecting it need 
not depend on changes in measured intelligence for confirmation of its 


success. 


work 
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CORRIGENDA 


In re-writing and re-copying the material for my article entitled 
“The Detection and Treatment of Accident-Prone Drivers,” which ap- 
peared in the Psychological Bulletin in November, 1946 (vol. 43, pp- 489- 
532) I overlooked several small errors which may annoy the reader un- 
less he correct them. They are as follows: 


p. 492, 2d line from bottom of text: 
For —0.21 put —0.021 
p. 492, second footnote: 
For —0.27 put —0.027 
For —0.26 put —0.026 
For 0.21 put 0.021 
For “counter” put “according.” 
p. 493, line 3: 
For 0.37 put 0.037 
For 0.18 put 0.018 
p. 494, last line: 
For 0.84 put 0.79 
p. 500, line 2: 
For 19 put 10 
p. 510, par. 1, line 3è 
For rı: put riz 


H. M. Jonnson 
Tulane University 
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Duntap, Knicut. Religion: its functions in human life. New York: 
McGraw-Hill, 1946. Pp. xi+362. 


This book is “a study of religion from the viewpoint of psychology,” 
an attempt to bring together in systematic fashion the characteristic 
phenomena of religious behavior and on the basis of these data to 
formulate a theory of religion which explains them. The theory thus 
arrived at is stated as follows: 

...in the early stages of religious development religion can be described 
as the attempt to do what man does not know how to do and to know the un- 
knowable, . . . in its present stage . . . religion is the institution or feature of 
culture, which undertakes, in the service of mankind, the functions for which no 
other institution is as yet adequately prepared (p. 321): 


The book is basically a volume in comparative religion. Its plan 
consists in a series of religious topics, under each of which is presented 
typical comparative religious data followed by observations concern- 
ing the psychological significance of these data. These topics include: 
religion as morality, concepts of divinities, the concept of sin, the con- 
cepts of the other world, the evolution of divinities, the role of desire 
in religion, religious symbolism, conversion, the nature of religion, and 
the future of religion in civilization, to mention only a few. 

The author makes generalizations repeatedly in the comparative 
religion portions of the book which can be made only by authorities in 
the field. In the psychological sections of the book, several hundred 
(estimated) generalizations are made, of which his over-all description 
of religion, previously quoted, is the central one. Many of these general- 
izations seem to be justified by the data on which they are based, 
assuming the completeness of those data. For example there are many 
such well substantiated observations as these: “Gods are not essential 
for religion . . . All religions, however, . . - reach a stage in which gods 
and goddesses appear and multiply (p. 98). . . only the concept of a 
Personal god can satisfy man’s feeling toward divinity. .. The sub- 
stitution of an impersonal world power for a personal god is a regression 
toward a more primitive conception”... (p. 336). Many of these con- 
clusions however would seem to be easily replaced by other and often 
contradictory hypotheses to explain the same data. 

There is nothing in the volume that remotely touches on experi- 
mental psychology. The major concepts in the psychology of personal- 
ity are only occasionally mentioned and then not systematically. 
Many psychologists will be astonished at this statement: “We make an 
unjustified assumption even if we ascribe motivation to animals, for 
Motivation involves desire, which is a complex process probably above 
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the mental level even of the ape” (p. 126). This is especially interesting 
because it is followed by a list of primary desires including the desire 
for food and drink, rest, activity, and the erotic desire, which do not 
seem to be absent from most animals. 

Three religious concepts are dealt with in generalizations which 
appear to be less profound than the problems themselves deserve. They 
are the concept of salvation, the problem of evil, and the problem of 
immortality. Two quotations are illustrative. “ ‘Salvation’ may be 
loosely defined as the process, or the condition, of being saved; that is, 
of being rescued from, or protected from, some harm, danger, risk or 
hazard” (p. 82). This is purely negative. In explaining evil, Dunlap 
says: “God, creating man in His image, made him a person and thus 
permitted man to create evil. This may not be a satisfactory solution 
_.. but it is the best any theologian has been able to think out” 
(p. 254). 

Those psychologists who seem to believe that an adequate inter- 
pretation of human personality and social behavior can be made 
without a serious evaluation of religious behavior ought to read this 
volume. However little or much one may agree with the author, he 
could not fail to be convinced of the importance of the field. 

Those whose religious convictions make it difficult for them to 
study it with truly scientific objectivity will do well to examine this 
largely dispassionate discussion of religious ideas. It will show them 
other points of view than their own. 7 

Despite this somewhat critical review, the reviewer finds himself in 
awe of and with considerable admiration for the courage of the author 
in undertaking so gigantic a task. The book will probably not receive 
nearly the breadth of reading it deserves. As a source book in religion 
and social psychology it must be given an important place in every 
library in which students investigate related problems. 

Ernest M. LIGON. 


Union College. 


FRIEDMAN, BERTHA B. Foundations of the measurement of values: The 
methodology of location and quantification. New York: Teachers 
College, Columbia University, 1946. Pp. viii-+227. 


Critical of the majority of present attitude and interest tests be- 
cause “little attention was paid to the theories upon which the tests 
rested, or to the means used to validate the test,” the author attempts 
to erect “a framework upon which value-testing instruments Cou 
built.” 

For this purpose she examines, at time! 
number of questions directly or indirectly related to t 
of values. Among these are the definition, location, an 


s in exhaustive detail, 4 
he measurement 
d quantificatio? 
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of values, and types of scales which have been, or might be used in the 
measurement of values. 

Since the purpose of value-measuring instruments is to aid in the 
prediction of conduct, values are defined in terms of behavior. As in- 
dices of value, it is proposed to use the way in which a person spends 
his time, money, or energy. Data may be obtained through overt be- 
havior or through verbal statements. Each method has its inadequa- 
cies. There is a further problem in the naming and classification of 
values, but here, as elsewhere in measurement, decisions should take 
account of the purpose of the investigator. 

Quantification of values may be secured through measuring the 
amount of time or money expended or the number of instances where X 
is chosen in preference to Y. However, the primary problem of quan- 
tification is the determination of the empirical meaning of scale scores 
and not the measurement of scale-point intervals. Psychological, and 
Not merely logical, continua must be established. 

Following the discussion of what constitutes measurement in the 
field of values—to which the greater portion of the book is devoted—the 
writer describes three experimental investigations of her own designed 
to contribute data on certain questions which she has previously 
raised, 

The book contains a comprehensive analysis of many problems 
basic to the measurement of values. Pitfalls and difficulties of great 
variety are pointed out. In fact, the writer is led at one point to in- 
Quire, “Is the possibility of locating values, then, hopeless? In reply 
she states, “But this inquiry, it must be remembered, did not begin 
with any assumption that values could always be accurately located, 
let alone accurately quantified. Instead, it set as its goal the improve- 
ment of the location and quantification of values, which it recognized 
Was an activity that was being pursued almost wholly without concern 
for methods which would bring increasingly better results.” 

To many readers the chief interest of the book will lie in the specific 
questions it raises in regard to the Thurstone scales and other widely 
used scales for the measurement of attitudes or values. 

While the author has undoubtedly made a contribution to the field 
of value-measurement, the discussion at times seems unnecessarily 
circuitous, and the reader is led to wonder whether the longest way 
round is really the shortest way home. And it must be added in all 
Sadness that the exhaustive consideration of theory does not appear to 
the reviewer to have improved the ‘“‘practice’’ in values measurement 
as represented in the techniques used by the author in her own ex- 
Perimental investigations. 

ELIZABETH DUFFY. 

The Woman's College of the University of North Carolina. 
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Mavs, A. B. The concept of vocational education in the thinking of the 
generat aaa, 1845 to 1945. University of Illinois Bulletin, 1946. 
p. 107. 


This development of general educators’ ideas provides background 
for understanding educational practices. It has wider significance than 
an historical account of slow progress from cultural to balanced-general- 
vocational philosophies of education. One may infer that during the last 
century two opposing forces controlled ideas: the traditional, advoca- 
ting a cultural education for higher conformities to the regional culture; 
the newer, advocating vocational education for higher efficiencies tO 
serve the needs of agriculture, commerce and industry. The genera 
educators’ ideas guarded the cultural features with few exceptions. 

Freeing’”’ ideas came from insights related to widespread surviva 
needs and satisfaction drives of economic interests and local commun 
ties—a few becoming social movements, chiefly with Federal support. 

Maladaptive social implications such as support for hardening 
unequal caste distinctions are seen for the cultural trend. Such insights 
are lacking for the economic pressures and competitive struggles an 
techniques, though they are implicit in symptoms of tensions and dis- 
turbances of thinking. 

The author faces the future with optimism on the basis of the 
balanced solution, though this looks like an unstable affair. Progress 
for the atomic age should develop the abandonment of competitive 
struggles and wider educational diets than balancing political an 
economic interests alone. The author and his selected quotations 
neglect needs related to building a world culture to secure cooperation 
and security; and to implementing the drives of free individuals and 
groups to realize their personal, social, and creative interests and de- 
velop their distinctive talents. 

James L. GRAHAM. 


Lehigh University. 


KincsLey, Howarp L. The nature and conditions of learning. New 
York: Prentice-Hall, 1946. Pp. xvi+579. 


When one attempts to evaluate a new textbook two points of view 
compete for his attention as frames of reference. The first appraises 
the newcomer with reference to earlier texts and the trends of develop- 
ment which they exhibited. The second derives from experience with 
the needs which such a text must meet. Accordingly, on the one hand, 
the reviewer looks for changes in scope and emphasis in comparison 
with previous works in the field. Once these have been identified, he 
evaluates the salient points of the new work with respect to degree of 
approximation to the structure of what the reviewer regards as a de- 
sirable textbook. 

Professor Kingsley’s text, when viewed within these frames of 
reference, impresses the reviewer favorably on both counts. More than 
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any other text with which he is familiar, the book presents psycholog- 
ical material in a manner most likely to assist the student, preparing 
for an educational career, in acquiring the elementary background of 
principle and technique prerequisite to advanced study. According to 
the author, the treatment of the field of educational subjects: 


. . rests on the conviction that learning is the core of educational psychol- 
ogy, that laboratory research provides the most reliable data concerning the 
fundamental nature and. conditions of learning, and that the educational 
psychology of learning must be based solidly on the fruits of the laboratory... . 
An effort has been made to secure a balance of experimental evidence, illustra- 
tion, interpretation, summary, and application. Illustrations are provided from 
everyday life and from the classroom to promote the comprehension of prin- 
ciples and to help bridge the gap between the research laboratory and the class- 
room, 


These quotations from the preface are amply substantiated in the 

body of the text. Part I deals with the development of the subject 
matter and techniques of educational psychology and their role in the 
educative process, as well as treating comprehensively the fundamental 
nature of learning. Part II treats the topics of maturation, varied 
activity, repetition, motivation, conditioning, and relationship and 
organization as six fundamental principles of learning, Part III deals 
with learning as the improvement of performance and with fatigue and 
physical handicaps as factors detrimental to learning. Part IV deals 
with seven forms of learning treated under the heads of the development 
of motor skills, the development of perception, memorizing, the de- 
velopment of comprehension, thinking as problem-solving activity, the 
development of emotional activity, and the development of attitudes 
and ideals. Part V, the concluding section, treats of the retention and 
transfer of learning. 
_ The book as a whole hews closer to the traditional pattern of texts 
in educational psychology than certain of the more recent books in 
this field. In many ways it just misses being an exceptionally well- 
planned text of the traditional sort. Thus it may be expected to be 
accorded a mingled reception. Consideration of the psychological 
aspects of educational development as it relates to the social develop- 
ment of the child and adolescent in our culture tends to be slighted. 
Many instructors in educational psychology, however, will be just as 
well content. Such matters, it will be maintained, should be left to 
texts in genetic and child psychology. But since state education de- 
partment are notoriously loathe to give certification credit for courses 
lacking the terms education or educational in their titles, the prospec- 
tive teacher is all too often denied opportunity of instruction in this 
important aspect of individual development. ` 

Well-balanced chapters on the development of emotional activity 
and of attitudes and ideals do much to overcome this deficiency, but in 
the reviewer’s opinion they are not completely adequate. Nevertheless, 
the book is a scholarly, readable, and current presentation of the sub- 
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ject which should prove to be a desirable text. The serious student will 
welcome the comprehensive bibliographies which accompany each 
chapter. 
' ? James M. Porter, Jr. 
Carnegie Institute of Technology. 


TRAXLER, ARTHUR E., & TOWNSEND, AGATHA. Another five years of 
research in reading. Educational Records Bulletin No. 46. New York: 
Educational Records Bureau, 1946. Pp. 192. 


Of all the research areas represented by the subjects of the school 
curriculum, that of reading has been, and still is, the most intensively 
cultivated. During the five-year period 1940-1945 — and these were 
“war years” — an average of more than a hundred printed reports O 
investigations relating to reading appeared annually. In view of this 
prolific output of research it is important now and then to summarize 
and to take stock. This the present monograph endeavors to do for the 
period just noted. Taken together with Ten Years of Research in Read- 
ing published under the same auspices in 1941, the monograph enables 
one to note trends with respect to problems studied and techniques 
employed. 

Some of the major trends are recorded in a four-page “Introduc- 
tion.” In the following sixty-nine pages the research reviewed 15 
classified and discussed under twenty topics. Pages 75-184 contain 
bibliographical data for 527 ‘‘selected” studies together with annota- 
tions limited to about one hundred words each. The monograph 1s 
concluded by an alphabetical list of the authors of the reading reports 
and by a subject index. 

The contents of the monograph are perhaps best disclosed by the 
list of topics or problems around which research centered in the perio 


1940-1945: 


1. Reading readiness and beginning reading 
2. Reading interests 
3. Reading in connection with the school subjects 
4. The vocabulary and content of elementary-school readers 
5. Vocabulary lists and vocabulary building 
6. Phonics 
7. Reading tests and testing procedures 
8. Speed of reading 
9, Eye-movements and reading ability 
10. Visual, auditory, and speech defects 
11. Dominance, handedness, eyedness and reversals 
12. Relationship between reading and other factors 
13. Activity programs and reading achievement 
14. Hygiene of reading 
15. Diagnosis of reading difficulty 
16. Remedial and corrective teaching of reading 
17. Developmental reading 
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18. Various aspects of reading (e.g., adult reading) 
19. Reading bibliographies and summaries ; 
20. Books and monographs on reading and the teaching of reading. 


_ During the past fifteen years the authors note increased research 
interest in the problems numbered above as (1), (2), (3), (4), (5), (6), 
(8), (14), and (17), with some loss of interest in problems (11) and (15) 
especially. 

The authors of the monograph are entirely competent as students 
of reading, and their comments and criticisms are insightful and sug- 
gestive. Consequently, their summary is obviously of signal importance 
to educators, whether they be concerned with the practical business of 
teaching children to read in the classroom or with the production of 
original research. But the monograph should be of value to psycholo- 
gists as well, for many of the problems discussed are fully as significant 
Psychologically as they are educationally. * 

WILLIAM A. BROWNELL. 


Duke University. 


Moncrierr, R. W. The chemical senses. New York: John Wiley, 1946. 

Pp. vii+424. 

First published in Great Britain in 1944, this book, written by an 
associate of the Royal Institute of Chemists, covers a wide range of 
subject matter on taste and smell, taken from the fields of biology, 
chemistry, electrophysiology, physics, and psychology. Although the 
author has done an admirable job of coordinating the technical lan- 
guages of these sciences into a very readable book, one might well ask 
just what audience or audiences the book was written for. There is no 
Preface disclosing the author's intent. 

_ Of the fourteen chapters the first is devoted to the place of the chem- ` 
ical senses in the animal and human world. Here the author gives a 
Teview of the importance of the chemical senses in organisms, ranging 
from a description of the avoidance of poisons by lower animals to the 
Use of odors in medical diagnosis. The material of four chapters dealing 
with taste and smell receptors, the nervous system, olfaction, and 
gustation is very similiar to that found in a general psychology text. 
Chapter 6, which is entitled “The Common Chemical Sense” might 
well confuse the reader who is unsophisticated in the traditional pro- 
lems of sense classification. In describing this “separate sense” (p. 
130) the author states: “It is clear that the chemical sense is distinct 
rom smell, taste, touch, and pain, and has a separate set of receptors. 
ese receptors are the free nerve endings of spinal and cranial nerves, 
and are com parable in distinctiveness with heat, cold, or pain receptors, 
ut are more localized.” There are various ways of classifying the 
Senses and one of them is based on the stimuli adequate for them. How- 
ever, this mode of classification has never had the appeal of others. We 
Could speak of the “electrical sense” or the ‘mechanical sense” or even 
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of the ‘sense of noise,” but we do not and for good reasons. It would 
seem better to regard the pain aroused by chemicals as belonging to the 
same “sense” as that elicited by mechanical, thermal, and electrical 
stimuli. This chapter describes the common chemical irritants which 
arouse various sensations, from lachrymators to vesicants. One chapter 
is devoted to chemical sensibility in lower animals. 

The author devotes a chapter to the classification of odors, pointing 
out the values and limitations of such systems as have been propose 
by Rimmel, Zwaardemaker, and Henning. Three chapters on chemical 
constitution and the physical properties of odorous and taste materials 
provide an excellent account of the descriptions of chemicals as relate 
to the various sensory experiences they can arouse. In summarizing 
these relationships into one hundred and twenty “general principles” 
(e.g., the main factor in determining odour is the architectural type ° 
the:molecule) the author states that “they are open to criticism, but the 
subject is in an early stage of development, and if they are criticized and 
replaced by a better lot they will have served their purpose.” Chapter 
12 includes a critical analysis of the theories of odor, with emphasis upon 
the constitution of odorous particles and the way in which they come 
into contact with the olfactory apparatus. The salient features of the 
theories of twenty-three authors are contained in a summary table for 
purposes of comparison. The author states that “. . . the transmission 
of the stimulus from the receptor to the brain is established fact” am 
adds that the nature of the brain processes is still a mystery. Little 15 
said about theories on the perceptual aspects of odor. 

Perfumes and essences, flavor and food, are discussed in the re- 
maining two chapters. Methods are described for separating essential 
oils for use in perfumes and essences. The chief components of some 
seventy oils are given in tabular form, showing their various odorous 
qualities. A detailed chemical analysis of some of the more common 
oils is given, together with a description of their commercial use. Flavor 
is regarded as a complex sensation, comprising taste, odor, and touch. 
Throughout, the author makes effective use of homely illustrations 
which add “flavor” to the text, such as adding glucose to wines, serving 
brandy from thin convex glasses, and masking castor oil with pepper 
mint. 

For a book which covers such a wide range of subject matter, writ- 
ten primarily around the chemistry of taste and smell as its core, The 
Chemical Senses is well integrated. It might well be placed on a reading 
list for some advanced courses in psychology. The book is also of value 
for psychologists as a reference source showing the relationships be- 
tween chemical stimuli and chemoreception. References to origina 
papers are given in the text, but no summary bibliography is included. 
The book contains a few charts and tables, numerous chemical formu- 
lae, a glossary of technical terms, and a good index. 

B. von HALLER GILMER. 


University of Virginia. 
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SPONTANEOUS ACTIVITY OF ANIMALS 
A REVIEW OF THE LITERATURE SINCE 1929 


J. DAVID REED 
The Johns Hopkins University 


Since the beginning of the century there has been a volume of work 
Studying spontaneous activity of animals, especially the rat and mog 
key. The best review article is that of Shirley (97) in the Psychological. 
Bulletin in 1929, Since that time the subject has not been reviewed NN 
comprehensively. Somewhat limited reviews appear in Munn (67), `; 
Morgan (66), and Gray (31). Other and even more circumscribed re- DNN 
views include Richter (73, 74) reporting work done under his direction; N 

loskins (40), describing some of the relationships between endocrines 

and activity; and Kreezer’s (52) summary of methods for measuring 
activity in the rat. A review of diurnal rhythms by Welsh (119) in 
1938 discusses much material not directly related to activity. Mettler 
(64) has reviewed and summarized studies on the effects of striatal in- 
Jury in 1942. 

This article does not attempt to cover work done prior to Shirley’s 
Teview in THIS JOURNAL. Several of the most important references to 
Work done prior to 1929 are included, but the emphasis has been almost 
entirely on later material. 


9 Juell 997 Nee. 
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Tur CONCEPT AND MEASUREMENT OF SPONTANEOUS ACTIVITY 


There are two points the reader should keep in mind as he proceeds 
through the paper. The first is a methodological issue. Much of the 
Tesearch to be reviewed depends on a general concept of spontaneous 
activity without regard to how the activity is measured. It will become 
Evident in the course of the review that our concept of activity must be 
weg to the measure of it which we have used, for the results one gets 
With one measure of activity may be entirely reversed when a different 
measure is used. The second closely related point is a matter of termi- 
ay Since the largest amount of work has used animals running 
"side a drum, it will simplify things if the term activity, without any 
Walification, always refers to running activity, not to other measures 
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of activity. Measures other than those in an activity-drum will always 
be clearly distinguished. 


Method and Apparatus 


Running Drums. Animals and human beings indulge in spontaneous 
activity. This observation has been quantified in many ways- The 
animal most frequently used in experimentation is the rat, whose activ- 
ity is usually measured in what has been called an activity cage, but 
will henceforth be referred to as a drum or running drum. This device 
was first used by Stewart (110) and has been most adequately describe 
by Slonaker (101, 102). It usually consists of two 10-13 inch circular 
boards mounted on a shaft and separated by a sheet of mesh woun 

ound their periphery (86, 94). The rat runs inside the freely rotating 
ail and a counter is attached to record the number of revolutions: 
Unfortunately, the usual system of measurement has shown only total 
activity, not activity as a function of time. 

Recognizing this inadequacy, Skinner (100) used a Harvard work 
tive recor 
whose slope is a constant measure of activity. k 

The drum has almost as many variations as there have been experi- 
menters in activity. Stewart’s 20-inch diameter drum and the 26-1n¢ 
diameter drum used by Park and Woods (71) represent one extreme, 
while Shirley (94) used a 10-inch diameter. Results reported in terms 
of number of revolutions are obviously not comparable when the diam- 
eter of the drums is not the same. Furthermore, equating the running 
by expressing it in distance traversed is of questionable validity in view 
of Farris’ statement that rats in larger wheels run farther than those 10 
smaller wheels (24). 

Depending upon the experiment, the rat may live entirel 
drum (94, 95, 101), have a separate living cage, or use SUPP 
diffuse activity cages (71). Since Richter (73) has shown that the nu 
ber of revolutions of the drum is reduced when the rat has a choice © 
several things to do, the results of different experimenters may not be 
comparable. 3 

The revolving drum has been the most extensively used laboratory 
instrument in investigating activity. Its physical variables have been 
discussed by Skinner (100) and Lacey (53). The reliability of the meas- 
ures obtained is remarkable—Shirley (94) reports a rank-order correla- 
tions of .97 for five-day totals of activity, and a split-half r of .90- 
Beach's (4) figures are even higher (.98). Unfortunately a basic assump- 
tion in these results involves the equivalence of the measures. Lacey 
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(53) raises the very justifiable criticism that the measure may be show- 
ing only the consistency of the different drums. It is significant to note 
that in one case in which the animals were changed from one cage to 
another, the correlation reported was .80 (113). There are wide indi- 
vidual differences even between litter mates in normal rats with respect 
to running, some rats running 200 revolutions per day and others 
20,000. The pattern of running is set up by the tenth day or not at all. 
After this time the individual differences are relatively constant. 

_ The running drum has been used to indicate tension or motivation 
in the rat. Thus, Durrant (20) and Slonaker (106) have correlated run- 
ning with sex drives. Geier and Tolman (28, 29) have used running 

ehavior to indicate increase in tension in the rat. 

Dorcus (16) devised a cage which moved slowly toward a goal object 
When the rat ran inside of it. 

_ Tambour- or Spring-Mounted Cages. Another apparatus for measur- 
Ing activity is that first used by Syzmanski (114, 115) which consisted 
of a spring-mounted cage attached to a lever recording system. The 
disadvantage of lack of damping has been somewhat overcome by 
tambour-mounted activity cages (73, 109). The three supporting tam- 

Ours are joined to one tube and record every movement on a kymo- 
8taph. Both these methods produce records according to time, but 
Tecords which are difficult to treat quantitatively because no ready 
Means of determining the total activity is available. 

Hunt and Schlosberg (42, 43) counted the number of 5-minute active 
Periods occurring in such a cage over varying intervals of time, and 
Irwin (44) recorded the number of active seconds per minute in new- 
born children, Wilbur (121) used a spring mounted cage connected to 
a Harvard work adder to obtain a summative record (of the activity 
Of chicks) which is much easier to interpret. 

Smith (109), measuring audiogenic and electrogenic convulsive ac- 
tivity, supported a cage from four pneumographs or by one large flexible 
hydron bellows. Oscillation could be reduced by means of a small vent. 

Other animals have been used with appropriately modified cages to 
record activity. Monkeys have been fastened by a nine-inch chain to a 
2.5-inch rod, so that movement caused the rod to advance a counter 
(84). A monkey-sized pneumatically-mounted activity cage has been 
used by Kennard, Massimy and Chevallier (51, 62). 

Other Automatic Methods of Recording Activity. Another measure of 
activity has been suggested, incorporating a tilting box (7) in which the 
Movement of the rat from one end of the box to the other advanced a 
Counter, Claiming that the tilting motion would interfere with accurate 
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measurement of rat’s activity, Siegel (98) utilized the animal’s motion 
from one end of a 22X6 inch box to the other to break a photoelectric 
relay and thus advance a counter. 

A horizontal turntable for exercising rats has also been used to record 
activity (21, 22). Since the distance the rat runs depends upon his 
proximity to the center of the turntable, it is probable that this device 
will not be popular in controlled experiments. 

Curtis (15), working under Liddell, reports the use of a pedometer 
to record activity of the sheep and the pig. Head-shaking in chickens has 
also been reported by Levy (58). 

Observational Method of Quantifying Activity. An observational 
means of recording activity has been used by Hall (32), Beach (3), an 
Fredericson (25). Hall recorded the distance traversed by rats in a 
round open field eight feet in diameter. Beach noted which of 36 squares 
a rat entered upon in the ten minutes it was free in an area three feet 
square. Fredericson observed six classes of behavior indulged in by rats 
in a field two feet square. 


Spontaneous Activity as a Behavior Category 


Now let us take a moment to see what the methods just pelea 
o 


have to do with the concept of activity. Most of the authors ten 
lump all manifestations of activity together and to pin one label on al 
of them—activity. This failure to distinguish types of activity in terms 
of its measure leads to a false concept of activity, for what data we have 
point to more than one type, or at least more than one aspect of activity: 

There are, for example, wide individual differences in the running 
activity of rats but not nearly such wide differences in restless cage 
activity. Tainter (116) found that caffeine, metrazol, and picrotox1) ha 
no effect on running but did increase behavior measured in a diffuse 
activity cage. Hunt and Schlosberg (43) found only 9% decrease 1% 
diffuse activity with castration instead of the 98% found by Hoskins 
(40) for running activity. . 

In light of these considerable differences, it seems logical that differ- 
ent terms should be used to distinguish the two devices and the be- 
haviors which they measure. Throughout this paper, the author has 
attempted to distinguish between running activity in the rotating drum, 

- on the one hand, and diffuse activity in a cage or stabilimeter on the 
other. As long as the terminology for these two distinct situations 1s the 
same, the notion will tend to persist that they are strictly comparable 
measures, which they are not. 
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Genetic Basis. Rundquist (89) by selective breeding has been able 
to get active and inactive strains of rats. The active strain is less easy 
to purify than the inactive strain. Selection for running produced strains 
in which there were measurable: (1) increases in number of successful 
matings, (2) increases in sizes of litters, and (3) decreases in the gesta- 
tion period. The active rats also had a higher basal metabolic rate than 
the inactive strain. The selective breeding of these strains has been 
carried through 29 generations, with no change beyond the 12th (9, 
30). Brody has concluded that the two strains differ with respect to a 
Single gene which acts as a dominant in males and a recessive in fe- 
males. This gene must act as an inhibitor, since none of the matings 
Within the inactive strain produces active offspring, but on the other 
hand, active-strain matings produce individuals which vary from ex- 
treme inactivity to extreme activity. The genetic factors are somewhat 
obscured by environmental influences. 

Age. Running activity of rats increases with age until the animals 
are about 80 days old, then is relatively constant until about 120 days, 
after which it gradually falls off till death (72, 95, 101). Richter (73) 
determined the amount of running, diffuse activity, and nest building as 
a function of age. The more active rats tend to have shorter lives than 


the inactive rats. 


THE INTERNAL ENVIRONMENT 
Nutrition 


The daily running activity of the rat increases just prior to the nor- 
mal time of feeding, even though the rat has been fed in what would 
Otherwise be an inactive period (72, 103). This fact is probably expli- 
Cable on the basis of hunger or metabolic changes connected with the 
daily 24-hour hunger rhythm initiated by constant feeding at a specific 
hour (120). Studies made by Richter (73) on generalized activity, how- 
Ever, show that the rat very probably has a two-hour hunger rhythm if 

€ is allowed to have food constantly available. 
r If a rat is deprived of food for a period of time, its activity will tend 
to increase for as much as 96 hours. If deprived of food and water, it 
creases for 72 hours before it drops off (73, 117), probably due to 
teakness, Brobeck (8) showed that if food intake and environmental 
€mperature are held constant, there is a negative correlation between 
activity and weight gain. Smith and Conger (107) varied the diet of 
Tats by keeping the caloric value constant but changing the proportion 
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of fat or protein. They found that up to 56% of the caloric value of food 
may come from fat without reduction in spontaneous activity. Fifty 
per cent of animal protein, however, induces marked reduction 1? 
running. Following ingestion of protein there is a marked metabolic 
rise which is not present following ingestion of fats. This result extends 
Slonaker’s (105) finding that a diet of 14 to 18% protein yields maximal 
spontaneous running. Hitchcock (39), however, fed rats eight grams of 
meat daily and observed little effect on running. $ 

A protein-free diet for short periods of time is effective in increasing 
activity. But a low-protein diet over a long period of time depresse 
activity (39). $: 

When allowed to select the amount of alcohol ingested, rats main- 
tained their activity, although the forcing of alcohol by inclusion in the 
drinking water supply caused a reduction in activity (35). 


Vitamins 


Rats deprived of food, water, food and water, thiamin, and a 
flavin showed increased activity until they were given ample aroun 3 
of the formerly scarce substance; then they were quiescent to a greate 


degree than normal until they had again ‘caught up’ (117). Limita 
n 


of Vitamin B is promptly followed by increased running activity Í 
rat; after five to ten days, however, activity is sharply decreased, long 
before any clinical signs of deficiency appear. This is accompanie pya 
drop in quantity of food ingested. When these rats are then given un- 
limited amounts of food and vitamins, activity decreases and remains 
low throughout the period of increased intake (5). These two papers He 
not completely corroborate Jackway’s earlier finding (47) that sligh 
deprivation of Vitamin B complex, not severe enough to cause any 
symptoms except retardation of growth and slight roughness of coats 
results in a decrease in voluntary activity. , t 
In a very suggestive paper Ziegler and Knudsen (124) showed tha 
young white rats fed on a rachitic diet during infancy ran less after 
recovery from the rickets than normals. If, however, the rats We re 
deprived of Vitamin D at an even earlier age by feeding the rachitic 
diet to their mothers during pregnancy, the surviving pups were more 
active after recovery than normals. Pt 
That deficiency is not always accompanied by increased activity 
is demonstrated by the fact that rats deprived of the magnesium ion 
show a consistent drop in activity without the initial rise usually ac 
companying deprivation (117). Smith and Smith (108) observed that 
rats fed a diet low in inorganic constituents gradually declined in VO" 
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untary running. The reduction in running occurred long before the 
appearance of impaired running ability. 


Drugs 


There has been interest in the effects of drugs on behavior as showing 
perhaps quantifiable action similar to that qualitatively produced in 
human beings. In general, analeptics stimulate activity (6), but have a 
depressant or diminishing effect with prolonged use. Thus cocaine, 
benzedrine, ephedrine, and propadrine increased running for single 
doses, but effects of tolerance showed when the specific drug was ad- 
ministered for several days (116, 123). Caffeine, metrazol, and picro- 
toxin depressed running activity, but enhanced restless activity as 
measured in diffuse activity cages (116). In a subsequent paper, Shulte 
and Tainter (91) have shown the differential temporal effects of the 
administration of caffeine, coramine, and metrazol on diffuse activity. 

Running activity is increased by benzedrine sulphate and by Kola, 
but there is a decrement in activity just before feeding time 24 hours 
later (35, 92). 

Phenobarbital administered intraperitoneally (69) to white rats 
depressed the nighttime running level, but had no effect on the daytime 
level which is normally much lower than that of the nighttime. These 
injected rats ate less, drank less, and gained less weight. Even after 
withdrawal of phenobarbital, the experimental rats were less active 
than the controls (49). 

_ Other drugs used have been dinitrophenol and thiouracil. These are 
discussed below. 


Endocrines 


Adrenals, In a review of experiments dealing with endocrines up to 
1927, Hoskins (40) reports that entire removal of the adrenals leads to 
Marked reduction in running. This operation is very difficult to perform 
Successfully, but in all cases in which there was little reduction in activ- 
iy subsequent autopsy showed hypertrophied adrenal particles. In 
12 out of 27 adrenalectomized rats, Richter (79) was able to restore 
running partially by implants of adrenal gland to the ovaries. Subse- 
quent removal of the grafts decreased activity again. Histological ex- 
amination showed viable cortical tissue but no medulla. Increase of 
Salt in the diet produced a definite but partial increase in activity. 

Tgographic studies show that while the absolute strength of muscles 
remains unaltered, the capacity for work following adrenalectomy is 
reduced to 8% of normal. Bilateral abdominal sympathectomy and 
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adrenal inactivation by section of the nervous connection reduced run- 


ning activity slightly for a period of 10 days with no apparent effect 
after that time (1). Lacey (54) injected adrenalin into rats and found a 
decrease in total diffuse activity although restlessness as indicated by 
behavior in the normally inactive period increased markedly. 

Thyroid. Hoskins (40) has reported no reduction in running resulting 
from thyroidectomy. This result has been questioned by Hall and Lind- 
say (34) who found 50% reduction using the same measure as that re- 
ported by Hoskins. Richter (75) has pointed out that the results ob- 
tained depend upon the amount of thyroid tissue remaining, for a very 
small piece is enough to sustain activity. Thyroidectomized rats may 
be re-activated by replacement therapy. However, if the quantity of 
thyroid administered is too great, the activity is once more reduced. 
That reduction in metabolic rate as such is not responsible for decrease 
in activity was shown by increasing the metabolism of thyroidectomize 
and normal rats by dinitrophenol (33). Inhibition of the thyroid by 
thiouracil resulted in slower growth, lowered metabolism, increase ae 
the length of interoestral periods, reduction of spontaneous activity a" 
disruption of rhythmic patterns of activity (61). Rundquist’s breed © 
active rats had a higher metabolic rate than his inactive strain (90). 

Pituitary. Ablation of the pituitary leads to marked decrease 1” 
running activity. Injection of emulsion of fresh anterior lobe of the 
hypophysis reduced the decrease in activity, but only in one animal out 
of seven did this replacement therapy cause activity to remain at the 
original level. Viable hypophyseal transplants to the anterior chamber 
of the eye had only a slight stimulating effect (81). 

When the stalk of the pituitary is severed in the female rat, the 


normal oestral periodicity is greatly slowed down. Five female om 
showed oestral periods separated by the following numbers of days: 
bably 4 


9.6, 11.2, 13.0, 14.3, 18.4. Richter explains this as being pro 
lation cycle. 


whole multiple of the fundamental four-to-five-day ovu i 
my elimi- 


This explanation is corroborated by the fact that ovariecto 
nated all cycles (77). 

The genetically hypopituitary animal, the dwarf rat, p 
urnal rhythm (70). 

Liver. Normal function of the liver is impor 


ossesses & di- 


tant in maintaining 
ion in 


activity. Ligation of the bile duct caused almost complete reduct i 
animals in which the duct remained occluded. Associated with ie 
tion 0 


effect was widespread destruction of liver cells, considerable dila 
the bile duct, and other profound peritoneal effects (80). Partial hepa- 
tectomy was performed by Dugal and Ross (17) who found that the 


SPONTANEOUS ACTIVITY OF ANIMALS 401 


25% of the liver remaining took the same length of time to regenerate 
as did activity to return to normal. 

Pancreas. The decrease in activity following pancreatectomy is vari- 
able. Some rats show little loss, others great. This reduction is attrib- 
uted to the inability to metabolize carbohydrates (85). 

è Ovaries. There are several measurable sex differences in daily run- 
ning activity. The female goes through a cycle of activity of about 
four to five days’ duration, the peak of which is reached at oestrus 
(23, 118). Hourly records of the female show variation in the distribu- 
tion of the activity as a function of the time in the oestrus cycle (103). 
This cycle may be abolished by bilateral ovariectomy (77, 87), when the 
activity in the drum is decreased by as much as 98%, but it is restored 

Y ovarian transplants (87). The presence of the uterus is not essential 
to this activity cycle, as is shown by the fact that hysterectomy has no 
Permanent effect on running activity (18, 19). If copulation takes place 
during oestrus, the activity is reduced until pregnancy and lactation are 
Over and the female is again in oestrus as shown by cornified epithelial 
cells or receptivity. Stimulation of the cervix during receptivity leads 
to Pseudo-pregnancy with about a 10-day marked reduction in running, 
followed by gradually increasing activity until the previous level is 
reached in about six days (104). k 

The cycles of the ovariectomized rat may be restored in females or 
Impressed in castrated males by several means, all of which involve 
Supplanting the hormonal deficiency. Pregnancy urine of women seven 
to nine months pregnant, given in drinking water, was found to restore 
activity to near normal levels (78). Amniotin injected in spayed fe- 
Males and castrated males had a similar effect (82). The earlier dis- 
‘Tepancy between results of other investigators, reported by Shirley, is 
Possibly explicable by the findings of Young and Fish (122). They were 
able to restore activity to levels prevailing before gonadectomy by es- 
trone administered in such a way that a constant source of the hormone 
Was maintained. A certain small amount of estrone is necessary for the 
Manifestation of running activity, but beyond that quantity, further 
&strone does not increase the activity nor alter its periodicity. These are 
Similar to Heller’s results (36) on male hormone. Castrated males show 
estrus cyclic activity from ovarian transplants. Elderly males increase 
Activity and lose weight following ingestion of an estrogenic fraction of 

Srionic gonadotropic extract (41). 

Testes. The running activity of the male is generally much less than 
that of the female, and lacks its cyclical characteristics. Complete cas- 
tration leads to marked reduction (about 98%) in running. Fractional 
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castration by Gans (27) of the following amounts of the testes of rats: 0, 
1, 14, 1%, 2, led to the following decrements in running compared with 
his controls: 20%, 11%, 26%, 48%, 79%. Probably there was not time 
for regeneration to take place. Castration before the 12th day after 
birth leads to no activity reduction in the adult rat (26) but this 
statement has been questioned by Richter (76) who found the reduction 
in running almost as great as in rats castrated after sexual maturity: 
Richter ascribes this difference to the relative inactivity of Gans’ 
controls. It is possible, however, that the findings of Hunt and Schlos- 
berg (42, 43) would bear on the question. They used general diffuse ac- 
tivity cages and found little difference between the number of five- 
minute active periods of normal and castrated males. The running. 
activity of both male and female castrated rats is increased by testic- 
ular grafts, provided these grafts are “functional takes” (87). 
The relation between sexual drive and running activity is unclear. 
‘A male rat in a running drum near one female rat will show peaks of 
activity correlated with her oestral periods. If several females are Ae 
nearby drums, the total running of the male is increased but is no longet 
of the simple oestral periodicity (20, 106). On the other hand, the cor- 
relation between sex drive (as measured by copulatory tests and the 
Columbia obstruction box) and running activity is near zero (113)- 
Rundquist (89) found that rats bred for activity were more fertile than 
those bred for inactivity. These facts are not contradictory but do 
require clarification. 


EXTERNAL ENVIRONMENT 
Light and Darkness 


The rat displays more running and more diffuse activity in darkness 
than in light. This rhythm is of a 24-hour periodicity and is remarkably 
stable. The daily activity periods are determined by the internal 4 
rhythm of the animal, which rhythm is only gradually changed by He 
external conditions—light and temperature (10, 13, 14, 43, 48). Using 
a rhythm involving alternation of eight hours of dark and eight hours 
of light, Hemmingsen and Krarup were unable to abolish the 24-hour 
rhythm even in rats kept under these conditions from birth (37). By 
using six hours of light and warmth, alternating with six hours of coo er, 
darkness, Browman (14) was able to impress a 12-hour rhythm on about 
30 out of 32 rats. The rhythm was not stable, however, as shown by 
the fact that the longest period during which it persisted in any one rat 
was 37 days, in spite of the continual 12-hour cycle of light and heat. ? 

The female rat shows a four-to-five-day oestrous cycle of activity 
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which can be somewhat altered by lighting in that constant light leads 
to longer intervals between peaks of oestrous activity and lowered daily 
activity (37). Browman (10) even found evidence of constant cornifi- 
cation (but not receptivity) in some of his animals. Subsequently he 
demonstrated that the intact visual apparatus was necessary for this 
Shift to take place (11, 12). Females who were blinded showed the same 
24-hour rhythm as those reared in constant darkness. Feeding schedules 
@pparently had little effect on the running time. That the intensity of 
the constant light is important has been nicely demonstrated by John- 
Son (48) who found that the amount of change in the period of maximal 
activity of mice was related to the quantity of cage illumination. These 
Tesults were obtained with mice in diffuse activity cages, since the mouse 
apparently exhibits even more dichotomous activity cycles than the 
Tat. Working with male rats in diffuse activity cages, Hunt and Schlos- 
berg (43) attempted to determine the source of the 24-hour activity 
thythm; since castration did not abolish the periodicity they came to 
the conclusion that it was not based on the testes. Reversing the normal 
day and night periods caused the rats to shift their period of maximal 
activity in about a week. 

In constant light, the female rat will show peaks of activity which 
are correlated with a daily cool period (13). This is also true of blinded . 
Tats. The pre-oestrus activity readings come at the end of the 12-hour 
cool period. It had previously been shown that in a hot room, rats will 
reduce their activity slightly (13, 97). Below about 40 degrees Fahren- 
heit the rat’s activity is reduced also. 


Temperature 


5 The effect of temperature on the diffuse activity of two-day-old 
Mice was determined by Stier (111). These young mice are still poikilo- 
€tmic so that control of external temperature had direct effect on the 
ody temperature. It was found that different straight-line Arrhenius 
Plots would fit the several measures used: amount of activity, quies- 
cence, and frequence of occurrence. Thus, he concluded that activity is 


a function of several factors. 


Activity Following Confinement 


The drive for activity hypothesized in several behavior theories has 
got been fully substantiated. Shirley (95) reports increased running 
activity after a rat has been confined one to two days, but after ten days 
ol enforced activity, running decreases. Siegel (99) found an increase in 

€ number of interruptions of a light beam following close confinement. 
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It is important to note that in neither case is the amount of activity 
under conditions of no confinement significantly different from the 
increase reported after confinement. 

Skinner (100) has stated that “if any extensive activity is pro- 
hibited during part of a day, the remaining part shows a greater ‘den- 
sity’ of activity per unit of time.” Since he measured activity for a five 
to six hour period beginning at 3:00 a.m., it is possible that his conclu- 
sion is an artifact based upon the normal peak of the rat’s running ac- 
tivity which occurs at about 2:00 a.m. After this time there is a gradual 
decrease in the amount run, a rhythm which will persist in constant 
darkness. Without further work substantiating the generality of the 
principle of greater activity following inactivity, it would be parsimo- 
nious to ascribe Skinner’s results to the specific hours at which he re- 
corded activity. 


Miscellaneous Studies 


Other studies using the running drum which might be mentioned 
are: the correlation between activity and errors in learning a maze is low 
(89, 96); while that between activity and time to traverse mazes 38 
higher (68); rats given difficult discriminations run less than rats that 

. are not made abnormal by difficult problems (60); a series of electro- 
convulsive shocks greatly reduced voluntary activity in rats (112). 


NEURAL CONTROL OF SPONTANEOUS ACTIVITY 


The search for a neural center which controls activity has led to 
contradictory evidence. There is complete agreement that frontal 
lesions do increase activity and that bilateral lesions are more effective 
than uni-lateral lesions. The hyperactivity usually takes two tO three 
weeks to emerge. The animals on which most of the work has been done 
are rats, cats, and monkeys. 

The first observation of the effect of cortical lesions on activity has 
been attributed to several clinicians and experimenters. In 1920 Lashley 
(57) quantified the change in activity of the rat by using a running 
drum and noted that only frontoparietal lesions both increased the 
number of hours of running and decreased the time spent in resting: 
J acobsen (45, 46) noted increased restlessness and general activity fol- 
lowing frontal destruction in the monkey, and Langworthy and Kolb 
(56) described the behavior of cats with heightened restlessness. Since 
1937, considerable evidence has been gathered concerning the effect on 
spontaneous activity produced by lesions in the brain. 
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Rats 


In the rat, the unilateral removal of the frontal pole did not appear 
to augment running activity. The inactive rats became hyperactive 
while the active rats did not increase their running relatively so much. 
Bilateral ablation of the frontal poles was much more effective in in- 
creasing running than the unilateral (3, 83). Beach (4) measured run- 
ning for 30 days before and 50 days after electrolytic destruction of 
varying amounts of the corpus striatum. Activity increased postop- 
€ratively in one animal, decreased in two, and was unchanged in two 
others. ‘No relationship between the magnitude of lesion and effects on 
activity could be determined. In the rat, the striatum evidently does not 
exert a controlling effect upon running activity as measured in this 
experiment.” On the other hand, Richter and Hines (84) found that 
monkeys with unilateral striatal lesions immediately had greatly in- 
creased activity, and Mettler (64) reports hyperactivity in cats follow- 
Ing striatal lesions. 

. According to Mettler (63, 64) when the striatum is injured, hyper- 
kinesia is the rule. He asserts that the striatum is an inhibitory mecha- 
nism; “stimulation of it produces inhibition and removal of it 
engenders evidence of motor release. It stands on the one hand between 
the cortex and the final common path as part of the route through which 
the cortex may exert an inhibitory effect and, on the other hand, it 
Operates between the thalamus and lower motor mechanisms in the 
automatic inhibition incident to ‘unconscious activity’.”” If the cerebral 
Cortex is totally removed, the decorticated animal does not exhibit in- 
cessant activity but shows an inability to initiate or inhibit movement 
Suddenly (68). 


Cats 


Cats with bilateral one-stage removal of the rostral portions of the 
Cerebral hemispheres were noted by Magoun and Ranson (59) to be 
almost continually walking about. Langworthy and Richter (55) re- 
corded increase in activity from 27 to 61 units from unilateral operation 
and to 399 units for the bilateral removal of motor cortex, premotor 
Cortex, and possibly a small tip of the corpus striatum in cats. 


Monkeys 


In monkeys, Richter and Hines (84) found that bilateral removal of 
areas 8, 10, 11, and 12 had little effect on activity, while that of 9 did 
(62). On the other hand, Kennard and Ectors (50) reported increased 
activity following removal of area 8 alone. These results are not con- 


406 J. DAVID REED 


tradictory in view of the method of measurement of activity. Richter 
and Hines (84) attached the monkey by a chain to a short steel rod pro- 
jecting from an axle. Movement of the monkey caused the rod to ad- 
vance a counter. The method of recording activity used by Kennard 
et al. was a pneumatically mounted diffuse activity cage. ý 

Generally the activity builds up in the course of two to three weeks 
following an operation. Ruch and Shenkin (88), however, report that 
lesions in area 13 (of Walker) consistently produce hyperactivity within 
the second post-operative day. Richter and Hines (84) also report such 
immediate hyperkinesia when the monkey has striatal lesions. 3 

Kennard et al. (51) have stressed the visual role in hyperactivity in 
monkeys. ‘Hyperactivity is markedly affected by visual stimuli. It 
disappears in the dark or when the animals have been deprived of vision 
either by enucleation of the eyes of by bilateral lobectomy. ‘Absence of 
auditory stimuli has not the same effect.” 

A decrease in activity was noted by Barris (2) following “pilateral 
one-stage removal of the rostral portions of the neo-cortex of cats. 
Kennard et al. reported that “hypermotility in monkeys and chimpa??? 
T is related to lesions of the rostral portions of area 6 and to area 8 

51). 


SUMMARY 


The literature of the last twenty years concerning acti 
mals has been reviewed. The methods, results and concepts © 
have been summarized and appraised. 

Several methods have been in use: 


1. The running drum: this apparatus yields high reliabilit 
taken on a particular drum, but there are also inconsistencies 
to another and from one experimenter’s design of drum to another. , The 

2. The diffuse activity cage: a cage mounted on tambours or springs 
record which it gives varies widely from one cage to another. y ve 

3. Several miscellaneous mechanical and observational methods, which ha 
not been extensively used. 


vity in ani- 
f activity 


y for measures 
from one drum 


g activity 22 
e. The indi- 
ewhat com- 


There are considerable individual differences in runnin 
well as variability of.one animal’s activity from time to tim 
vidual differences are due in part to heredity, but are somev bé 
plicated by environmental influences. Intra-animal variability cam a 
ascribed to several factors: running increases during hunger and mo 
deprivations, during darkness and cool periods, and during oestrus: 


$ a 
ds of endocrine imbalance or deficiency, there is usually 


various kin ! 
running drum but 0° y 


decrease in activity—a marked decrease in the 
a small decrement in the diffuse activity cage. . 
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Running activity and diffuse activity are sometimes affected in the 
same way, sometimes differentially. Both reach a maximum during the 
cool or dark part of a 24-hour cycle. Some of the analeptic drugs stimu- 
late both kinds of activity, but other drugs may increase diffuse activity 


` while decreasing running activity. 


Injury to. the brain ‘affects activity. In particular, lesions of the 
frontal cortex heighten activity, and bilateral lesions cause a greater 
increase than unilateral injury. Still, it is not yet clear whether there is 
a specific activity center in the brain and, if so, where it is. 

There is now a very large body of data concerning animal activity, 
but it needs further definition and interpretation. Particularly needed 
is a clarification of the concept of activity in relation to the method of 
measuring it. Most treatments of the subject tend to regard activity as 
a single entity. Yet, in some cases, where comparable measures of ac- 
tivity are available from different devices, running drum and diffuse 
activity cage, the results are not the same. Activity, it would then ap- 
pear, does not constitute a single behavior category which can be meas- 
ured with any instrument but must be considered, for the present at 
least, in terms of its method of measurement. 
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SAMPLING IN THE REVISION OF THE 
STANFORD-BINET SCALE 


ELI S. MARKS 
National Office of Vital Statistics 


In another paper (4) the writer attempts to point out the biases 
which may arise through types of sampling procedure quite common in 
psychological research. The present analysis is devoted to another effect 
of sampling methods commonly used in psychology—namely, the sub- 
stantial increase in sampling error which results when “cluster” methods 
of sampling are used. It should be noted that this is not a criticism of the 
cluster type of sampling. Cluster sampling is an extremely valuable 
device and makes feasible many studies which would otherwise be com- 
pletely impossible. However, the use of cluster techniques implies sub- 
stantial modifications in our formulae for sampling error and psycholo- 
gists are, in general, not familiar with these modifications. Unfortunately, 
much of the important work in the field appears in sources which are 
relatively inaccessible to psychologists. Ignoring the effects of cluster 
sampling on measures of sampling error has undoubtedly resulted in 
attaching importance to results which are statistically insignificant. In 
the testing field, failure to allow for cluster sampling has probably 
caused us to attach a measure of precision to our results considerably 
in excess of that warranted by sound statistical techniques. 

Cluster sampling almost always involves an increase in sampling 
error as compared with unrestricted random sampling of the same num- 
ber of cases. It is, of course, possible to obtain a lower sampling error 
with cluster sampling than with unrestricted random sampling if we 
make up our clusters for this purpose. However, the main reason for the 
use of cluster sampling is to permit the sampling of previously existing 
groups (the clusters) and, in most cases, the use of a previously existing 
grouping of the population involves a positive intraclass correlation of 
the variable studied, i.e., our existing groups are almost always more 
homogeneous internally than groups of the same size made up by ran- 
dom selection of individuals from the population. It is the existence of 
Positive intraclass correlation which cuts down the amount of independ- 
ent information available from a cluster sample of a specified size and 
Occasions the substantial increase in sampling error usually associated 
with this sampling method. The present analysis is designed to empha- 
Size the substantial increase in sampling error which results from rela- 
tively small intraclass correlations. While this phenomenon is quite 
familiar to sampling statisticians, psychologists are rather generally 
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unaware of the marked disturbances of sampling error calculations and 
tests of significance introduced by the use of cluster sampling when a 
positive intraclass correlation exists. 

Although methods resembling cluster sampling are quite common in 
psychological research, very few psychological studies have used sam- 
pling designs which permit us to determine the standard error of the mean 
or of other sample statistics. As a matter of fact, it is difficult to find a 
study where analysis of the sampling error formulae used is not com- 
plicated by the presence of a non-measurable design (one in which the 
sampling probabilities are unknown). Some of the difficulties in the use 
of non-measurable designs are explored in a paper by McNemar (2) 
which discusses accidental sampling and purposive sampling as well as 
such measurable designs as unrestricted random sampling and stratified 
sampling. 


THE SAMPLING PLAN OF THE STANFORD-BINET 


The writer has, therefore, not attempted to find a study with a 


measurable design, but has selected for analysis the sample used in the 
revision of the Stanford-Binet. This sample has been selected for analy- 
sis principally because the widespread use of the revised Stanford-Binet 
makes the problems involved in its standardization extremely important 
in spite of the lapse of a decade since the revision was completed. The 
revision of the Stanford-Binet is also a good example for our purposes 
because (a) it was an extensive project, involving a relatively large 
number of subjects and the expenditure of considerable amounts © 
time, effort and money and (b) the purposes of the sample were OF 
plicitly formulated and clearly stated by the authors of the revis¢ 

Stanford-Binet. 

The reader should bear in mind that the present analysis is °” a 
“as if” basis. The Stanford-Binet sampling design does not yield statis 
tics with measurable standard errors and no amount of statistical MO 
nipulation can overcome this defect. The cure lies not in statistic? 
formulae but in more careful sampling techniques in future investig® 
tions. However, the use of measurable sampling designs in psychologic 
research will almost inevitably mean cluster sampling of some sort ana 
any other approach will be beyond the limited resources usually i 
able to psychologists. Thus, an examination of cluster sampling, €V°" 
on an “‘as if” basis, is extremely pertinent to the future of any psycholog 
ical research which involves statistical techniques. data 

In my analysis I have relied entirely upon statements and dat? 
published in Terman and Merrill (6) and McNemar (3). Since the 
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required for this analysis have not been published in full detail, I have 
been forced to use approximations at several points. Inquiry indicated 
that more detailed data could not be furnished without considerable 
expenditure of time and effort. Since the approximations used in this 
paper are satisfactory for purposes of illustration and since the sampling 
techniques used in the revision of the Stanford-Binet preclude a com- 
pletely accurate determination of error even if the detailed data were 
available, this deficiency is not serious. In nearly every case, the effect 
of the approximation used has been to understate the sampling error. 

In revising the Stanford-Binet, the major objective was to construct 
scales “so standardized for difficulty as to yield mean 1.Q.’s of approxi- 
mately 100 at all age levels” (Terman, in 3, p. 3). The authors of the 
revision realized that their success in this objective was dependent upon 
securing a measure of the distribution of test scores in the general 
population (or in a satisfactory sample of the population). The sample 
was restricted to “American born” subjects of the “white race” in the 
age range from 1} years to 18 years. Terman notes that “elaborate 
precautions were taken to make the sampling as representative of the 
entire population as circumstances permitted”’ (3, p. 6). 

According to Terman and Merrill this was done by selecting Sly) 
different communities in 11 states” (6, p. 12). They note that: “The 
selection of localities for the second year’s testing was based upon certain 
considerations in regard to sampling which had resulted from a study of 
the socio-economic level of the first 1500 subjects.” These considera- 
tions were what the authors viewed as an inadequate representation of 
the rural group and a difference between the occupational distribution ; 
of fathers of the cases tested and the occupational distribution of all 
employed U. S. males. In the second year’s testing, therefore, the au- 
thors of the Stanford-Binet revision “took care to include several addi- 
tional rural communities” (6, p- 14) Neither McNemar nor Terman and 
Merrill give further details on the method of selecting the communities. 
It séems evident that selection was not on the basis of random sampling 
(neither simple random sampling nor random sampling within strata). 
As a matter of fact the term “community” is not defined clearly enough. 
to permit a rigorous statement of the primary sampling units used. 
Nevertheless, we can visualize our population as being composed of 
“communities” (undefined but definable), so that the entire population 
of the United States can be broken up into a fairly large number (prob- 
ably over 3000) of communities. 

_ Within each community different procedures were followed for cases 
in the three age groups—13 to 54 inclusive; 6 to 14 inclusive; and 15 to 
18 inclusive. These groups were sampled as follows: 
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1. The group aged 6 to 14. Schools of “average social status” were selected 
in each community (method of selecting schools not further specified) an 
within each school all of the children between the ages 6 to 14 who were within 
one month of a birthday were taken, regardless of grade placement (6, p. 15). 
This sampling procedure is, then, a subsampling of subclusters with a 100 per- 
cent sample take within the subcluster. 

2. The group aged 15 to 18. Subjects were selected so that “the advanced 
group would be as nearly as possible continuous with the intermediate, with no 
break between fourteen and fifteen years. The compulsory school age was 
taken into account, the general character of the population, and the type ° 
secondary education that was offered. In each community the school census 
was consulted to determine the amount of elimination after age fourteen. e 
made certain that some of the twelve-, thirteen-, and fourteen-year-olds who 
had gone to high school were included, also some of the slow fifteen- and sixteen- 
year-olds who were still in intermediate school. A few cases who had graduated 
from high school were included and a few who had dropped out of school with- 
out completing high school. These out-of-school groups were sampled by choos- 
ing siblings of school children in numbers proportional to the amount of elim- 
ination at ages above fourteen.” (Sampling in this group was actually a roug 
type of “quota” sampling.) 

3. The group aged 1} to 5}. This group was sampled in much the same man- 
ner as the out-of-school cases in the group aged 15 to 18. The authors “chose 
as far as possible younger sibs of the school groups.” Children were secured by \ 
use of birth records, school census, school siblings, kindergartens, well baby f 
clinics, day nurseries, nursery schools and “personal report.” Use of the variou " 
sources differed from community to community. “Great care was exercise 
in the large population centers to include representative groups; if a school in @ 
suburban district which had been chosen as average on the advice of super!” 
tendent and counselors seemed to include too large percentage of higher cee 
pational groups it was offset by a tenement district center.” The authors state 
that “in the smaller communities, from seventy-five to eighty percent 0 aa 
pre-school child population of appropriate age was examined” (6). In the pu a 
lished tabulations results for children aged 14 were omitted and further refer 
ences deal only with the sample of children two years of age or over. 


It may be noted that the population sampled is limited to individ- 
uals within one month of a birthday (or half-year birthday for childre” 
under six). The population is also limited to American-born white 
persons and, in the age range six to 14, to children attending schoo 
These limitations do not affect the propriety of generalization from 2 
sample to the population so defined. The limitations may affect generi 
lization from the sample to all native-born white persons aged two 7 
18. This is not, however, of primary concern in this paper. Limitation 
on generalization resulting from the use of sub-populations are discuss ‘ 
by the writer in another article (4). For our present purposes, it 1$ su 
cient to accept the population, as defined. sas 

To summarize, the sampling plan of the Stanford-Binet revisio 
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involved: (a) sampling of “communities” from the aggregate of all 
United States communities; (b) the subsampling of schools for children 
aged six to 14 and taking all children (in the population as defined 
above) in the selected schools; (c) the subsampling of other members of 
the defined population from the “community” without any intermediate 
subsampling of schools but with the use of a rough type of “quota” 
Sampling. 


BIASES AND VARIANCE IN THE STANFORD-BINET SAMPLING 


The above is only an approximate statement since it is extremely 
hard to formulate exactly the sampling plan used. The method of selec- 
tion at each stage of sampling has not been specified above. It seems 
likely, however, that the sampling error of the plan used is greater than 
the error which would be involved in random sampling of “communi- 
ties” with equal probability of selection and no subsampling (i.e., a plan 
which would take all persons in the communities selected). 

It is obvious that a sampling plan not involving subsampling will 
have a lower sampling error for the same number of clusters sampled 
than a plan which did involve subsampling. The assumption that the 
community sampling actually used involved a larger sampling error 
than random selection is not as clear cut. Actually the sampling re- 
sembled “purposive sampling” or ‘quota sampling” but it does not 
appear to conform even to the rather loose requirements of these two 
techniques. 

In discussing purposive sampling Neyman (5) developed certain 
hypotheses which, if satisfied, would make the estimate commonly used 
in this method the “best linear estimate” (i.e., an unbiased linear esti- 
Mate with variance less than that of any other linear estimate). Neyman 
Notes that: 

If these hypotheses are not satisfied, which I think is a rather general case 
we are not able to appreciate the accuracy of the results obtained. Thus this is 
not what I should call a representative method. Of course it may sometimes 
give perfect results, but these will be due rather to the uncontrollable intuition 
Of the investigator and good luck than to the method itself. 


_ While the Stanford-Binet revision did not involve purposive sam- 
Pling, Neyman’s remarks are applicable to the sampling plan. Further- 
More, there is internal evidence in the results of the Stanford-Binet re- 
vision which indicates that, in spite of the purposive attempt to secure a 

representative” sample, the Stanford-Binet revision sample actually 
Produced a larger sampling error than would have resulted from random 


Sampling of clusters. 
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Table 3 below gives the number of cases from each of the communi- 
ties included in the Stanford-Binet sample. It should be noted that 37 
percent of the “urban” cases were drawn from San Francisco and 56 
percent of the “suburban” cases came from two California communi- 
ties. This means that 975 cases or 34 percent of the total sample were 
from California. In addition a disproportionately large number of the 
“rural” cases (41 percent) came from one community in Vermont. It 
would, of course, be possible to obtain clusterings in two states as 
marked as those shown by a random sampling of communities, but the 
probability of such an outcome is extremely small. It is almost certain 
that a random (or stratified random) sampling of communities wou 
have given a better geographic distribution (and undoubtedly a lower 
sampling error) than was actually obtained. This fact is also pointed out 
by McNemar (3) who expresses ‘‘skepticism concerning the represen- 
tativeness of these communities.” 

_ It should also be remembered that the authors of the Stanford- 
Binet revision felt very definitely that their results contained a substan” 
tial bias. As noted above, the primary objective of the revision was t° 
obtain a scale giving average I.Q.’s of 100 for each chronological age 
group. Terman and Merrill (6, p. 23) note that the mean 1.Q.’s for 
their age groups run “slightly above 100” and state that this ‘tis the 
result of intentional adjustment to allow for the somewhat inadequate 
sampling of subjects in the lower occupational classes.” McNemat 
(3, p. 20) states: 


The fact that the means in Tables 1 and 2 are above 100 should not lead the 


cupational status, the age means approach nearer 100, anda further adju 
aac rural representation would tend to bring the values still ¢ 
to . 


s for each 
pation 
except 


Table 6 on p. 36 of Terman and Merrill (6) gives average TO, 
age group “adjusted for 1930 Census frequencies of Occu 
groupings.” These averages still show substantial bias, all means s 
those for ages 4 and 54 being over 100 and seven age groups having 
average I.Q.’s over 103. The effect of rural-urban biasing influences 15 no’ 
presented. 

Since the method of correcting for bias is not stated, t 
these corrections on the mean square errors of the sample results ‘ 
be determined. It is probably not possible to make this determina 
y event since the presence or absence of biases in occupation 
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rural-urban distributions cannot by themselves tell us whether an 
1.0. distribution is biased or unbiased and correcting for rural-urban or 
occupational biases may have very little effect (or even an unfavorable 
effect) upon I.Q. biases. 

_ In any event, the original sample means of the Stanford-Binet re- 
vision contain substantial biases if the true population means are 100. 
These are shown by the figures in Table 1.* 


TABLE 1 
AVERAGE I.Q.’s By AGE GROUPS FOR THE STANFORD-BINET REVISION SAMPLE 


Age Groups 
24-54 6-13 14-18 All Cases 
From L—Mean 106.58 103.22 103.03 104.00 
Form M—Mean _ 106.42 103.96 103.32 104.43 
Number of Cases 728 1623 619 2970 


In view of the probable biases and the considerations with regard 
to the sampling method presented above, it is not at all unreasonable to 
assume that the Stanford-Binet revision sampling involved a larger 

‘standard error of the mean than would random selection of communi- 
ties with equal probability. Even if this is not the case, the subsampling 
volved should account for an increase in sampling error over a design 
in which there was no subsampling. 

On the basis of the above discussion, the standard error of random 
selection of communities with equal probability and no subsampling 
gives us minimum values for the standard errors of the Stanford-Binet 
Sample means. To estimate these errors, we shall assume that the num- 
ber of cases actually sampled in each community was the total eligible 
Population in that community. (As noted above, assuming that the 
community population was larger than the number sampled would lead 
toa larger estimate of the standard error.) 


* The data are from McNemar (3) Tables 1 and 2. There are minor differences be- 
tween the results presented by McNemar and those presented by Terman and Merrill 
(@pparently due to inclusion of some subjects in some of the distributions and their omis- 
sion in other distributions). The differences are minor and do not affect the present 
analysis. 

The data in this and in the two subsequent tables are reproduced with the permis- 
Sion of Houghton Mifflin Co., the publishers of McNemar'’s The Revision of the Stanford- 
Binet Scale. 
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THE STANDARD ERROR FOR CLUSTER SAMPLING 
The standard error for the type of sampling described (i.e. “cluster 
sampling’’) is given by: . 
M 
> Ne — #)? 
E l (1) 
(M — 1)m MN? 
Or, when we estimate gz from the sample, the estimated standard error 
is given by: 


oz”? 


M-—m Xu Nela: — #')? 0) 


Mm (m — 1)(N")? 


sz? = 
where M =the total number of clusters (communities) in the population 
m=the number of communities sampled 
N;=the number of individuals (eligible for the popula 
the i-th cluster 
%;=the mean I.Q. for the N; individuals in the i-th clu 
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larger than m (at least 100 times as great). Consequently, we can, with- 
a appreciable error, take M-—m/M equal to 1. With this substitution 
he square of the standard error is approximately equal to: 


LN: — 2)? Nea w)? 
z m 7 


mlm — 1)(N’)? m—1 ( = ny 


All the data required for Equation [3] can be obtained from the 
Pia Unfortunately, not all of the sample data are available in pub- 
ished form. Since we shall have to rely on published data, some further 
approximations (described below) are necessary. The approximations 
also act to reduce our estimate of the standard error. 

. McNemar (3) gives, as Table 9, information on the average I.Q.’s for 
children in “urban,” “suburban” and “rural” communities by age 
8roups. This table, plus data for the entire group in the age range 2 to 
18, is presented in Table 2. The data for the entire group were calculated 


from the information given for the three age groups. 


TABLE 2 
1.Q. DATA FoR URBAN, SUBURBAN AND RURAL CHILDREN* 


[3] 


se = 


Urban Suburban Rural Total 

2-5} Year-Olds 

Number 354 158 144 656 

Mean 106.3 105.0 100.6 104.7 

S.D. 15.7 16.1 15.4 15.9 
6-14 Year-Olds 

Number 864 537 422 1823 

Mean 105.8 104.5 95.4 103.0 

S.D. 14.7 16.8 15.5 16.1 
15-18 Year-Olds 

wasiber 204 112 103 419 

Mean 107.9 106.9 95.7 104.6 

S.D. 16.5 15.7 15.9 16.9 
All Ages (2-18) 

Number 1422 807 669 2898 

Mean 106.2 104.9 96.6 103.6 

S.D. 15.2 16.5 15.7 16.2 


* Denver 2- to 5}-year-olds are excluded. 
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the communities classified as “urban by McNemar; (b) the average 
LQ. for suburban children for each of the communities classified as 
“suburban” and (c) the average I.Q. for rural children for each of the 
communities classified as “rural.” This approximation ignores all varia- 
tions between communities within the urban, suburban and rural groups 
of communities. As a result the values of sx which we obtain should be - 
equal to or less than the values which would be obtained if we knew 
the means of each of the sampled communities.* 
There are some uncertainties in the published dat 
values of m and N;. As noted above, Terman and Merrill (6) stat 
17 communities were sampled in 11 states. This would give m= 
However on pp. 36-37, McNemar (3) lists the communities sampled an 
the number of subjects in each community. McNemar lists 7 urban 
communities. He also lists 3 suburban communities and states that, an, 
the suburban group, there were “four small communities just out © 
Kansas City in Johnson County Kansas, with 199 cases dra 
Westwood View, Hickory Grove, Roseland, and Shawnee 
schools.” For the rural communities, McNemar states: 
The samplings from rural communities include 85 from Mount 
School, Bullitt County, and Liberty School, Oldham County, 
total of 152 were drawn from the following districts of Indiana: Prather Sc 


Charlestown schools and Morgan Township School in Harrison County 4 


Galena School in Floyd County. A farming region at Bloomington, 
supplied 92 cases; the farming and small village community of Rando P 2 Vir- 
mont, provided 275; and 65 subjects were secured in the vicinity of At ° enta- 


ee aos : re 
ginia. We have already expressed some skepticism concerning the rep’ 
tiveness of these communities. 
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From this statement, it is difficult to determine the exact nu 


“rural communities” involved. At a minimum, there appear om- 
(assuming that schools in different counties represent different © 
munities). If we also consider the four schools in the “suburban. ting 
of Johnson County, Kansas, to be one community, McNemar § a dif- 
gives a count of 19 communities vs. Terman and Merrill’s e 

ference appears to be one in the definition of community. ie nities 
independently selected areas, Terman and Merrill’s 
is probably more nearly correct. However, the data in Table d 
on McNemar’s classification. It appears desirable to adopt McNemat 
ise, counting as communities the cities and towns listed by ; 


a ti 
er certain circum 


es 
he circumstar di jdual 


* This statement cannot be made absolutely since, und 
iv! 


may be incorrect. However, it is a fairly safe statement since t 
would give 2 higher standard error through substituting group averages 


averages are extremely unusual. 


See 
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This is the same basis we used 
oned above. Since the 
robably Terman and 
ates in the same 


plus any schools in separate counties. 
In getting the count of 19 communities menti 
count of independently sampled communities is p 
Merrill's figure of 17, this handling of the problem oper: 
direction as the other approximations previously made. 

A The difficulty in determining W; occurs in the cases where McNemar 
gives one figure for the number sampled in two different counties (e-g-, 


TABLE 3 


NUMBER or CASES SAMPLED IN EACH COMMUNITY AND 
ESTIMATED AGE DISTRIBUTION 


p 2-5} 6-14 15-18 All Ages 
eS Vear-Olds Year-Olds Year-Olds (2-18) 
Urban 
1. Denver, Col. 28 67 16 111 
2. Minneapolis, Minn. 46 111 26 183 
3. New York, N. Y. 12 29 7 48 
4. Reno, Nev. 28 68 16 112 
5. Richmond, Va. 46 114 27 187 
6. San Antonio, Texas 63 155 36 254 
7. San Francisco, Calif. 131 320 76 527 
Suburban 
8. White Plains, N. Y. 31 107 22 160 
9. Redwood City, Calif. 26 89 19 134 
10. Los Gatos, Calif. 62 209 43 314 
11. Johnson County, Kan. 39 132 28 199 
Rural 
12. Bullit County, Ky. 9 27 7 43 
13. Oldham County, Ky. 9 27 6 42 
14. Clark County, Ind. ahii 32 8 51 
15. Harrison County, Ind. 11 32 8 51 
16. Floyd County, Ind. 11 31 8 50 
17. Bloomington, Minn. 20 58 14 92 
18. Randolph, Vt. 59 174 42 275 
19. Atlee, Va. 14 41 10 65 


85 cases from Bullitt County, Kentucky and Oldham County, Ken- 
tucky). These cases can be handled by distributing the cases equally 
ogg the counties involved. This adjustment also operates to reduce 

he estimated standard error. A further approximation is necessary to 
Set standard errors for the means of each of the three age groups in Table 
2. McNemar gives only the total number of cases in each community 
and does not give the distribution of these cases among the age groups. 

O estimate the standard errors for the separate age groups, the number 
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of cases for each of the communities was distributed by age proportion- 
ately to the age distribution in the class (urban, suburban or rural) in 
which the community falls. The number of cases in each community 
shown by McNemar and the calculated distribution of these cases by 
age groups is shown in Table 3. This adjustment affects only the esti- 
mates of the standard errors of the age group averages and not the 
standard error for the entire group aged 2 to 18. 


COMPARISON OF CLUSTER SAMPLING ERROR WITH 
UNRESTRICTED RANDOM SAMPLING ERROR 


With all the adjustments reducing the standard error which have 
been made, it may seem surprising that we have any error left. How- 
ever, a fairly substantial amount of sampling error remains. Table 
shows the standard errors of the mean I.Q. calculated as described 
above (using Equation 3) compared with the standard error obtaine 
by the formula usually used in psychological research studies, 1.¢- 


A 4 
ana m 
where 
m Ni 
> Oa- #)? g 
PONOS. DE [ ] 
o = V 
and 


N=} N: 


alue (1.Q- 


In Equations [4], [5] and 6 , xij stands for the v 
q [4], [5] and [6], i ds A a 


jth individual in the ith cluster (community) and t i 
have the meanings previously defined. Equation [4] reptesen mi 
standard error of the mean of a sample drawn by unrestricted m t the 
sampling from an infinite population (i.e. a sample drawn so 
probability of drawing any observation in the population a 
and independent of the probability of drawing each of the © 
vations). 
It will be seen from Table 4 that the absolute values of 
errors calculated by Equation [3] are not large. There is a tf 2,89 
er cent in the average I.Q. for the entire group 


error of only 1 p ape 3 cen 
cases. However, a very substantial difference exists betwe 
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s of the standa" 


(£0 


À 


` Cm 


SAMPLING IN THE REVISION OF THE STANFORD-BINET SCALE 425 


ard error by Equation [3] and the standard error by Equation [4]. If 
we apply Equation [4] to determine the standard error of the mean ofa 
cluster sample, it is obvious that we shall be very far from the correct 
value (in this case we would get an error which is less than one-third of 
the correct figure). 

__This fact is extremely important in applying tests of significance to 
differences of sample means. For example, suppose we took a sample 


TABLE 4 


ESTIMATED STANDARD ERRORS OF THE MEAN I.Q.’s FoR CLUSTER 
SAMPLING AND UNRESTRICTED RANDOM SAMPLING 


Standard Errors Ratio of S.E. of Cluster 
Age G : Unrestricted Sampling to S.E. of 
pereues Chister Somers Random Sampling Random Sampling 
25} years 60 62 97 
ae: years 1.09 .38 2.89 
5-18 years 1.35 82 1.63 
All Ages (2-18) 1.01 30 3.36 


of 900 children aged 2-18 (by a method which was actually random) 
from some city or other population subgroup.’ Assume that this sample 
ives us an average I.Q. of 105.7 on the revised Stanford-Binet and our 
Sample has a standard deviation of 18, so that the standard error of the 
Mean (using, quite properly, Equation [4]) is .60. Our group has a 
mean 2.1 points above the average of 103.6 for the Stanford-Binet re- 
Vision sample shown in Table 2. We want to know whether this dif- 
erence is significant. If we assume unrestricted random sampling of 
the Stanford-Binet revision sample, we would use .30 (see Table 4) as 
the standard error of the revision sample mean. This would give us .67 
as the standard error of the difference of 2.1 and our difference would 
e 3.1 times its standard error. We would undoubtedly consider this a 
Significant difference. Actually, the standard error of the mean of the 
revision sample is at least 1.01, which makes the standard error of the 
difference 1.17. The difference is actually only 1.8 times its standard 
rror and can hardly be considered significant. 
é The sample used for the Stanford-Binet revision is not an extreme 
ase of the error which can be made by applying formulae based on un- 
restricted random sampling to data obtained by cluster sampling. The 
Sampling for the Stanford-Binet revision did involve testing individuals 
Tom several communities and the standard error for cluster sampling is 
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only 3 times the error for random sampling of the same number of in- 
dividuals. Many studies use data from one or two groups (e.g.) elemen- 
tary psychology classes in two neighboring colleges) to draw conclusions 
about the whole population (all college students or even all human 
beings). In this case the standard error obtained from Equation (3] 
may be 50 to 100 times greater than that obtained from Equation [4 
Use of the “correct” formula (‘‘correct” if we have used a random 
„process for drawing our groups) will make supposedly significant dif- 
ferences vanish more rapidly than a quart of ice cream at a children $ 
party. 


INTRACLASS CORRELATION 
or un- 


The reason for the difference between the standard error f 
restricted random sampling and that for cluster sampling is to be fete 
in the fact that individuals are not sampled independently in cluster 
sampling. If we consider samples of equal size from the same population, 
the standard error of the mean in unrestricted random sampling 6 
multiplied by approximately (1-+Np) when we use cluster sampling: 
Here N is the average size of our clusters and p is the intraclass correla- 
tion (a measure of the extent to which individuals within a cluster ie 
semble, or are “correlated” with, each other). The intraclass correlation 
usually ranges from 0 to +1 (although it can be negative). Tt can bẹ 
seen that even very small values of the intraclass correlation (say Or 
cluster sampling if the clusters are moderately large (N =100 or more’. 
As a matter of fact, the estimated intraclass correlation for th 
sample (all individuals aged 2 to 18) used in the Stanford-Binet rev", € 
is only .08. A recent paper by Walsh (7) gives some of the proba wet 
considerations involved in tests of significance when intraclass correla 
tion is present. ğ 
Pace is one feature of Table 4 which may arouse SO 

is is the fact that the estimated standard error (using Equat 
of the mean I.Q, is larger for the group aged 6-14 years than 
group aged 2-5} years. This is, of course, contrary to what we aon) 
expect from the description given of the sampling process. 
extent this peculiarity results from our ignoring subsampling 
lating the standard errors. Consideration of subsampling varia 
would probably increase the standard errors somewhat and would pr? 
ably increase the standard error more for the group aged 2- pye r 
than for the group aged 6-14 years (since there are fewer of the yopi 
children). As a matter of fact, inclusion of subsampling variation M! 


me interest: 
jon ) 
for the 


X 
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double the standard error for the mean I.Q. of the group aged 2-53 but 
would probably not increase the standard error of the group aged 6-14 
more than 10 per cent 

Actually Table 4 shows a lower standard error of cluster sampling 
for the group aged 2-5} years than for the group aged 6-14 years be- 
cause there is less variation among the average 1.Q.’s of the urban, 
suburban and rural children for the younger group. This fact may be 
due to some basic relation between I.Q. variability and age. For ex- 
ample, McNemar (3) gives a table for adjusting 1.Q.’s for differing 
standard deviation of the I.Q. at various ages. He bases this table on 
the differences actually found in the sample. d 

Another explanation of the differences in variability between age 
groups is to be found in the selective nature of the sampling for the 
Stanford-Binet revision. Selective sampling seems to have been par- 
ticularly important in the pre-school group. In another article (4), the 
present writer points out some effects of selective sampling on group 
means and also notes that selective sampling will usually affect the 
standard deviation also. It would be very unwise to hypothesize about 
the difference between age groups shown in Table 4 unless we had much 
more information about the sampling probabilities. 

This article has used the Stanford-Binet only as an illustration of 
the dangers of ignoring the intraclass correlation when we are dealing 
with a cluster type of sampling. In view of the qualifications placed on 
our analysis, it is not possible to draw any conclusions about the relia- 
bility or unreliability of the revised Stanford-Binet as a measuring in- 
Strument. There may be good reasons for supposing that the precision 
of the revised Stanford-Binet is considerably less than many of its users 
assume, From the sampling standpoint, the sample design used in the 
revision of the Stanford-Binet was a non-measurable design and there 
is no way of telling how ‘‘bad” or “good” the results were. It has been 
Suggested that the sampling errors shown in Table 4 are probably mini- 
mum figures. However, the results do offer a possibility of improving 
the sampling design in the event that the Stanford-Binet is revised 
again in the future. An error of 1 I.Q. point in the average 1.Q. may 
Not be too serious. If this is the case, the biases in the Stanford-Binet 
Average I.Q.’s could probably be removed by using a sound sample 

esign without any need for an increase in either the number of com- 
Munities covered or the number of subjects tested. If greater accuracy 
than a mean correct within 1 per cent is considered necessary or de- 
Strable, this could probably be achieved by increasing the number of 
Communities sampled without increasing to any great extent the total 
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number of subjects tested. As a matter of fact, increasing the number 
of subjects tested would probably add very little to the accuracy of the 
final results (at least for the age group 6-14 years). The standard error 
of a mean in cluster sampling decreases (approximately) in proportion 
to the square root of the number of clusters sampled. The standard 
error shown in Table 4 for unrestricted random sampling is .3 of an 1.Q 
point. The standard error for cluster sampling is 3.36 times this value. 
Therefore, to get a standard error of .3 using cluster sampling, we woul 
need about 11 times as many communities or about 200 communities: 
This estimate of the number of communities required is, of necessity, 
unreliable, since we were forced to estimate our standard errors from E i 
sampling plan which is actually non-measurable, and measuring the 
non-measurable puts an obvious strain on epistemology. 4 

In designing a sampling plan for a revision of the Stanford-Binet 
recent developments in sampling theory and practice can be usg a 
increase accuracy without increase in survey costs. The reader $ (i) 
tention is directed particularly to the work of Hansen and Hurwitz ce 
in this field. Using the techniques developed by Hansen and Huns 4 
persons revising the Stanford-Binet would probably get satisfaco i 
precision from a well-designed sample of 25 to 100 communities wi 
only a very small increase (if any) in the total number of cases teste 


SUMMARY 
Jation 


This article stresses the dangers of ignoring the intraclass corre The 


of the population when “cluster” methods of sampling are usec. aa 
increase in sampling error resulting from cluster sampling 15 demia 
strated by an analysis of the results of the sample used in the reva 
of the Stanford-Binet. This sample actually yields “non-measur® r 

results, i.e. results which do not permit determination of the en, 

error of the sample mean. However, it is estimated that the stan nic 

error of the average I.Q. of this sample is at least 3 times the error vA 
would be calculated by the use of the formula for unrestricted ran fil? 
sampling from an infinite population. The latter formula 1s OP ieee 
familiar to psychologists and the one usually used by them rego ei , 
of the type of sampling involved. The illustration indicates that te 
substantial errors may result from.this practice and that many T 
will be considered statistically significant where such a conclus! 

entirely unwarranted. 
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APPENDIX 


_ Although the formula for the standard error of the mean for cluster sampling 
1S not new, psychologists are generally unfamiliar with it. The derivation of 
this formula is, therefore, presented below. The s transformation will be found 
useful in deriving standard errors for more complicated designs (e.g.-designs 
Using stratification, subsampling, differential sampling probabilities, etc.). 
Equation [2] gives the mean square error (square of the standard error) of 


the mean of a cluster sample as: 


\ 


Naa — #)? 
M-m 2 


im Aena 


The mean of the sample is: 


m 
D Niži 
g = X f 


UN: 


te ae m, Ni, ŝa 2 WV’, «and WV are defined on p. 420 It is also convenient 
enne: 


xij = value for jth individual in ith cluster 


., = sum of the values for all individuals in ¿th cluster. 


From their definitions, it can be seen that: 
Xi 


N; 
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and, therefore 


m 


È si 


i 


LN: 


. e . rva- 
#' can be treated as a ratio of two linear functions of the sample obse 
tions, namely: 


M > M = Ni 
f(a) =— s =— L ty 
m i m i i 
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M 
KN) =—D N= 
m q m 


m Ni 
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In deriving sz, it will be useful to prove the following theorem: 


Theorem: If we have a sample estimate: 


i Xp an 
where f(x) and f(y) are linear functions of the sample observations 


ya(h=1, 2+ ++ n) and if: 
c= Bs), y= BO BET 


then: 


f 

i eter ° 
where ov? is the mean square error of r’ and r is the population para 
which 7’ is an estimate. mean sq d 

Proof: When we have a sample estimate r’ =f(x)/f(y), the ound * ane 
error of r’ can be found by: (a) expanding 7’ as a Taylor series ae Sal off 
y (the expected values of f(x) and f(y); (b) subtracting 7 (the tru Ae pot? 
for the entire population) from both sides of the equation; (c) re of the 1° 
sides of the resulting equation and (d) taking the expected val sl derivat” 
sultant. If we ignore, in our Taylor series, terms involving partia 
higher than the first, the result of this operation will be: 


op? = E(r’ — r)? 


x 2 ENa? w amity: y 
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If we let 
Eh Yh 
Zh = za ii y 
and if f is a linear function, then 
-i JO) 
f@) = ; 
and 
a Ef) _ * %_9 
go- ES - 
_ f(z)" 4 orn? _ onas 
Ofe) =a Ea ey 
Therefore: 
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theorem ca be appl ' 
e p: 


follows: 
_, _ (2) 
770) 
where . 
f(x) - Èa or f(x) -ZS En 
m Ni 
Ms =— Ni; where Ny = 1 
MeN 1) >>> i 
and 


Ef(x) = 2 si =% 


M 
EN) = UN =N. 


432 ELI S. MARKS 


Let 
Xi Ni 2 ij 1 
zi = —— — of zj; = — 0 TS 
x N 2 N 
M m m 
DDE =N: 
M, m m i 
JOÐ =— Dox = 
m i x N 


By Equation [11]: 
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Since f(s) is M/m times a sum of the sample values 3: 
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= Ef(z) = 0. 
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An unbiased estimate of sqn? from the sample is: 
z; — 7)? ? 

ney 7" aon 

Mm m—1 À 


* The result is the same whether the z transformation is applied to the cluste 
or the individual observations. 
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where 


Se De EM 


m mX mN 


From Equations [13], [14], and [16] we have: 


7 
D (@& — 2)? 


w= (=) A un 
: v/ (M—1)m M 
and 
T: 
E ye — m) 2 @ ) J [13] 
2 N Mm m—1 


In Equation [17] we substitute the values: 


x; N: » = i 
z = — — — z=0 z 
j x N : ‘ M 
and get: 
x 2 
Ciad 
a (M-m “í $ [19] 
j (M — 1)m MN? 
or 
5 
N2(&: — 2)? 
De M.— m i [20] 
d (M — 1)m MN? 


We make the same substitutions in Equation [18] and also substitute for 
®and W the sample estimates: 
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and 
N: 
y-t ki 
M m 


This gives: 


> M—m Mele ia [21] 


Mm (m — 1)(N’)? 


or 


>; N24: = x)? 


gis M— m m [22] 
M m=i n 2 
> Ni 
, i 
? F oiia into the 
In some cases, cluster sampling may introduce a substantial bias 5 of the 


sample standard deviation (when the sample S.D. is used as an esum? mater 
population S.D.). This bias will be practically eliminated by use of thee 
sè = oe + sg? 


where c, is the sample S.D. and s, is an estimate of the population SD. from 2 
Equation [23] can also be used for estimating the population >: * 
sample with unrestricted random sampling. 
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ILLUMINATION STANDARDS FOR EFFECTIVE 
AND EASY SEEING 


MILES A. TINKER 
University of Minnesota 


, The problem of artificial illumination is of primary importance in all 
inside working environments. To maintain healthful and efficient func- 
tioning of the eyes, it is necessary to provide adequate lighting. Un- 
questionably, proper illumination contributes much to comfort and 
efficiency in activities of daily life. Working under faulty illumination 
frequently results in eyestrain which tends to be accompanied by reflex 
functional disturbances of other organs. 

During recent years a “lighting consciousness” has been forced upon 
a large portion of the population, particularly upon those who do con- 
siderable visual work under artificial light and upon those who must 
decide upon the illumination requirements of schools, offices, factories 
and other situations where visual work is to be performed. Although 
interest in lighting has been stimulated by popular articles, advertise- 
ments, and “educational pamphlets” —as well as by reports written by 


educators and medical men—the more fundamental information has 


appeared as experimental reports in scientific publications. The result 


of exposure to this material is a keen interest in illumination and a 
sincere desire on the part of the public for sound information concern- 
ing hygienic lighting. The natural tendency is to consult pamphlets on 
recommended practice when lighting specifications are needed for a 
particular situation. Frequently, the applied psychologist will be called 
upon to furnish advice on proper illumination. In many instances he 
will be asked to evaluate the materials presented in the recommended 
uently, the applied psychologist should be informed 


practices. Conseq 
hich the lighting specifica- 


concerning the adequacy of the data from w 
tions in the recommendations are derived. 
The first code on lighting was issued by the Illuminating Engineering 
Society in 1915. In the more recent publications, the codes are known 
as Recommended Practice of Home Lighting, of Office Lighting, etc. 
These pamphlets have been prepared by the Illuminating Engineering 
Society either alone or jointly with the American Institute of Architects, 
usually under the rules of procedure of the American Standards As- 
Sociation, Although the American Psychological Association has been 
in existence for over 50 years, and even though applied psychologists 
have been interested in the field and have been making experimental 
contributions to the hygiene of vision for over 40 years, neither psy- 
435 
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chology nor psychologists are represented in the group specifying recom- 
mended practices. Furthermore, a large body of psychological literature 
has been ignored, either because the illuminating engineers were not 
familiar with it or because they chose not to use it. The result has been 
an emphasis upon the engineering aspects of lighting with inadequate 
attention to certain psychological factors. More recently there has been 
some attempt to consider more of the psychological factors. Perhaps 
because engineers lack a psychological background, interpretations are 
frequently erroneous. Probably the most satisfactory approach to hy- 
gienic lighting could be achieved by coordinating the contributions of 
engineers, physiologists, and psychologists. 

Recent editions of recommended practices reveal an increased em- 
phasis upon control of direct and reflected glare, brightness contrast, 
and the diffusion or distribution of light. The tendency to specify rela- 
tively very intense light for many visual tasks is prominent. The pur- 
pose of this paper is to present a critical examination of the specifications 
in the more recent editions of recommended practices and to scrutinize 
some of the data from which the recommendations were derived. 


SPECTRAL QUALITY OF LIGHT 


In general, spectral quality of light receives adequate treatment H 
recommended practices (35, 36, 37, 38). It is stated that with equa 
foot candles of illumination, variations in color quality of light found E 
common illuminants have little or no effect upon the visual discrimina 
tion involved. When color is to be discriminated, it should be v 
under as close an approximation of daylight as possible. Luckiesh 
has a valuable discussion of light and color. 


QUALITY OF LIGHTING dif 
i 


Recommendations (35, 36, 37, 38) concerning control of glare, ad 
fusion, direction and distribution of light, light reflection value, E 
effects of finishes on ceilings and wall are ordinarily quite satisfactory 
Visual discrimination is improved by moving the glare source away 4 aa 
the line of vision and by reducing the brightness of the light source “es 
the amount of light emitted by the light source toward the eye. Bre ts 
ness of luminaires should be low in value. High brightness conte 
within the field of vision should be avoided whether on the work sce T 
or in other parts of the visual field. Proper diffusion of light nen itt 
eliminate undesirable shadows. Purely local lighting, therefore, ots 
satisfactory. Since the reflection factors of objects in the visual env 
ment play an important role in illumination, the finish of ceiling 


a walls, 


i 
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floors and furnishings is important. These surfaces should provide 
reflecting surfaces to help spread the light about the room. Further- 
more, they should be such that undesirable brightness contrast does not 
Occur within the field of vision. Shiny or glossy finishes should be 
avoided to prevent specular glare. i 

_ In the recommended practices, informative discussions on classifica- 
tion of lighting systems are usually included. Also illustrations of fix- 
tures and installations are sometimes given. Some attention is given 
to daylight illumination and the need of coordinating artificial with day- 
light lighting, 

INTENSITY OF ILLUMINATION 


Intensity of illumination receives by far the greatest emphasis in 
Specifications. With each revision of a lighting code prepared by il- 
luminating engineers, the foot candle recommendations for a given situ- 
ation rise. One may well question whether this trend has a scientific 
basis, or whether the consumer has been educated to accept the higher 
intensities. In 1934, Luckiesh and Moss (11) presented general recom- 
mendations which they considered to be very conservative. These are 
Tepeated with slight changes in Luckiesh’s 1944 book (10). He adds 
that these are inadequate in many cases where hundreds and even 
thousands of foot candles of light are desirable. Examination of the 
recommended practices of lighting reveals that, for the most part, they 
are based upon researches done and interpretations made by Luckiesh 
and his co-workers, or upon researches inspired by them. Let us turn 
first, therefore, to these reports. 

In Light, Vision and Seeing, Luckiesh (10), and in the New Science 
of Seeing, Luckiesh and Moss (11), make the following foot candle 
recommendations for common tasks of the work-world: 

1. 100 foot candles or more are specified for severe and prolonged visual 
Work. Examples include fine needle work, pen work, engraving and assembly, 


and discrimination of fine details involving low contrast. 
2. 50 to 100 foot candles should be used for proof-reading, difficult reading, 


Watch repairing, and average sewing. i 
3. 20 to 50 foot candles are listed for such visual tasks as clerical work, 


Ordinary reading and average sewing on light goods. 
4. 10 to 20 foot candles are proposed for ordinary reading and sewing on 


light goods when the task is not prolonged. 
5. 5 to 10 foot candles are needed for visual work which is more or less inter- 


Tupted or casual. 
6. 1 to 5 foot candles are sufficient for perceiving large objects. 


Luckiesh (10) states that these are minimum foot candle recommen- 


dations and that he considers them to be very conservative from the 
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viewpoint of ease of seeing. Furthermore these foot candles, according 
to Luckiesh and Moss (11), are far below the intensities of illumination 
which new knowledge indicates to be ideal. . 

These recommendations are derived from various sets of data which 
will be discussed in turn. 


Preferences for light intensity. Luckiesh and Moss (11) cite cate 
preferences for light intensities to support their contentions that high 
intensities are necessary for adequate seeing. The mean choice was 
about 100 foot candles but the median was 50 foot candles when up to 
1000 foot candles were available. Tinker’s analysis (22) of light eon 
ence studies indicated that visual adaptation plays an important De 
in determining the preferences. In an experimental check, Tinker (2 i 
found that when readers were adapted to 8 foot candles, the media! 
choice for comfortable reading was about 12 foot candles. But when 
adapted to 52 foot candles, the median choice was 52 foot candles. It is 
obvious that the intensity of illumination to which the reader is ma 
plays a dominant role in his illumination preference. The wae i 
therefore, that preference for illumination intensity is not a satisfac! a 
method for determining the intensity of light needed for efficient v15' 
work, i 
Visual acuity. Luckiesh and Moss (11) and Luckiesh (10) AP 
visual acuity as a basic factor in reading (and presumably in O A 
visual work). Itis true enough that visual discrimination does depri 
somewhat upon visual acuity. But is visual acuity an adequate ree 
for prescribing appropriate lighting? Luckiesh and Moss (13) acm” 
that in many tasks the criterion of visual acuity is relatively eet 
propriate, e.g. in tasks involving low contrasts. But they point out * 
for black test objects on a white background, visual acuity poe cae ES 
to 100 foot candles. As a matter of fact, Lythgoe (15) has shown 
under certain conditions of measurement, visual acuity improves he 
and beyond 1000 foot candles. Inspection of the data reveal tha ial 
knee of the curve of improvement is at about 10 foot candles an in 
beyond about 20 foot candles the gains are slight. It must be kep old 
mind that in measuring visual acuity, one is dealing with ehren Ee 
values. Itis highly questionable whether the almost microscopic 8 li- 
in visual acuity obtained under the high foot candles justify their api 
cation to visual tasks where supra-threshold visibility is involved EE 
most everyday situations. Furthermore, data reveal that the v!S 
acuity curve is practically horizontal from 50 foot candles to the hig 
levels. seal acu” 

Luckiesh and Moss (11) and Luckiesh (10) cite data on visua Be 
ity for 1, 10, and 100 foot candles only. If they really desired to fin 
foot candle level beyond which no practical gains in visual acuity 
they should have investigated the range between 10 and 100 foot 


curls 
can- 
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dles. As shown in Tinker’s reviews (29, 31), this criticism may be aimed 
at all the basic data presented by Luckiesh (10). In some instances 
(decrease in heart rate, decrease in convergence reserve of ocular 
muscles), data for only 1 and 100 foot candles are presented. This pro- 
cedure is inexcusable in experiments designed to determine how much 
light intensity is needed for efficient visual work. It appears, then, that 
visual acuity data are of only slight use for prescribing illumination 
intensities for visual discrimination in supra-threshold tasks. If ac- 
cepted, there is no justification for suggesting that more than 40 to 50 
foot candles are necessary for adequate discrimination even for tasks 
that approach threshold discrimination. 

_ Visibility measurements. Luckiesh (10) states that “After establish- 
ing a standard of visibility or desirable see-level to be attained if pos- 
sible for all tasks, it is seen that specifications of light and lighting and 
other aids to seeing can be based upon visibility measurements.” The 
measurements are to be made by the Luckiesh-Moss Visibility Meter. 
This is a device consisting of two identical circular gradients which are 
rotated before the eyes to alter the brightness contrast of the object 
whose visibility is to be measured. It, therefore, reduces the object to 
threshold visibility. It is the threshold which is measured. Three as- 
sumptions are made: (a) Two objects are equal in visibility when both 
are barely visible, (b) “Two objects are equally above threshold visi- 
bility when their visibility has been increased by the same increase” in 
size, brightness, brightness contrast or time, (c) “The visibility of an 
object, or degree of supra-threshold visibility, is proportional to the 
decrease in any one of the fundamental factors necessary to reduce the 
object to threshold visibility.” These assumptions are considered to be 
axiomatic and arguments against them are considered to be futile. Nev- 
ertheless, since recommended standards are based upon visibility 
Measurements to a large degree, it seems desirable to examine the 
Matter further. Things are not axiomatic just because some one says 
they are. 

Since visibility measurements are in terms of threshold values, they 
are analogous to visual acuity measurements. They are subject, there- 
fore, to the same criticisms as visual acuity measurements as criteria for 
Prescribing illumination standards. 

Luckiesh (10) emphasizes foot candles for equal visibility in pre- 
Scribing illumination intensities. For example, to make newspaper text 
Matter equivalent in visibility to 8 point book type on white paper under 
10 foot candles of light, it is necessary to use 30 foot candles. And to 
make the 1/64” divisions on a steel scale equal to this visibility level, 
180 foot candles are needed. Are these levels of illumination intensity 
required for efficient and comfortable seeing? Luckiesh (10) assumes 
that this is a conservative standard. On his empirical scale, the 8 point 
type with 10 foot candles has 48 per cent maximum visibility. (Maxi- 
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viewpoint of ease of seeing. Furthermore these foot candles, according 
to Luckiesh and Moss (11), are far below the intensities of illumination 
which new knowledge indicates to be ideal. F 

These recommendations are derived from various sets of data which 
will be discussed in turn. 


Preferences for light intensity. Luckiesh and Moss (11) cite amaron 
preferences for light intensities to support their contentions that high 
intensities are necessary for adequate seeing. The mean choice was 
about 100 foot candles but the median was 50 foot candles when up tO 
1000 foot candles were available. Tinker's analysis (22) of light prefer- 
ence studies indicated that visual adaptation plays an important role 
in determining the preferences, In an experimental check, Tinker (26) 
found that when readers were adapted to 8 foot candles, the median 
choice for comfortable reading was about 12 foot candles. But when 
adapted to 52 foot candles, the median choice was 52 foot candles. It is 
obvious that the intensity of illumination to which the reader is adapted 
plays a dominant role in his illumination preference. The conclusion 1$, 
therefore, that preference for illumination intensity is not a satisfactory 

. method for determining the intensity of light needed for efficient visua. 
work, y 

Visual acuity. Luckiesh and Moss (11) and Luckiesh (10) list 
visual acuity as a basic factor in reading (and presumably in other 
visual work). Itis true enough that visual discrimination does depen 
somewhat upon visual acuity. But is visual acuity an adequate criterion 
for prescribing appropriate lighting? Luckiesh and Moss (13) admi 
that in many tasks the criterion of visual acuity is relatively MAR 
propriate, e.g. in tasks involving low contrasts. But they point out tha 
for black test objects on a white background, visual acuity improves up 
to 100 foot candles. As a matter of fact, Lythgoe (15) has shown tha 
under certain conditions of measurement, visual acuity improves UP 
and beyond 1000 foot candles. Inspection of the data reveal that the 
knee of the curve of improvement is at about 10 foot candles and that 
beyond about 20 foot candles the gains are slight. It must be kept 19 
mind that in measuring visual acuity, one is dealing with threshol 
values. It is highly questionable whether the almost microscopic gains 
in visual acuity obtained under the high foot candles justify their appli- 
cation to visual tasks where supra-threshold visibility is involved as 1? 
most everyday situations, Furthermore, data reveal that the visua 
a curve is practically horizontal from 50 foot candles to the higher 
evels. 

Luckiesh and Moss (11) and Luckiesh (10) cite data on visual acu- 
ity for 1, 10, and 100 foot candles only. If they really desired to find the 
foot candle level beyond which no practical gains in visual acuity occur 
they should have investigated the range between 10 and 100 foot can- 
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Thieisa pas are to be made by the Luckiesh-Moss i ] 
Sotated ie Ray consisting of two identical circular gradients which are 
ioraa elore the eyes to alter the brightness contrast of the object 
thrash cpa! is to be measured. It, therefore, reduces the object to 
Sum old visibility. It is the threshold which is measured. Three as- 
ptions are made: (a) Two objects are equal in visibility when both 
es barely visible, (b) “Two objects are equally above threshold visi- 
ne when their visibility has been increased by the same increase” in 
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they are. 

Since visibility measurements are in terms of threshold values, they 
S analogous to visual acuity measurements. They are subject, there- 
ore, to the same criticisms as visua teria for 


prescribing illumination standards. 
Luckiesh (10) emphasizes foot candles for equal visibility in pre- 


Scribing illumination intensities. For example, to make newspaper text 
ioe equivalent in visibility to 8 point book type on white paper under 

0 foot candles of light, it is necessary to use 30 foot candles. And to 
make the 1/64” divisions on & steel scale equal to this visibility level, 
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required for efficient and comfortable seeing? Luckiesh (10) assumes 
that this is a conservative standard. On his empirical scale, the 8 point 
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mum visibility is obtained from a test-object whose critical detail has a 
visual size of 20 minutes; a critical detail of 1 minute is the smallest 
visible for persons with normal vision under 10 foot candles of light.) 
But no adequate experimental check is made for performance of these 
tasks under various levels of illumination. Tinker (27) found that the 
critical illumination level (the intensity beyond which no further change 
in reading performance occurs as the intensity is increased) for reading 
7 point newspaper type to be approximately 7 foot candles. It is difficult 
to conceive the need of going above 20 foot candles to provide a margin 
of safety above the critical level. It is highly probable that an experi- 
mental check will reveal that other visual tasks, like discriminating the 
divisions on a steel scale, do not require the 180 foot candles indicated 
for efficient vision by the computations of Luckiesh. Related to this is 
the question of comfortable vision. Harrison (8), in discussing the diffi- 
culty of using high intensities because of the introduction of glare fac- 
tors states “Visibility and comfort are two separate factors which do 
not always overlap completely.” ' 

No one will deny that visibility is an important factor in ease of 
seeing. But to prescribe standards in terms of scores derived from 
measurements made with the Visibility Meter is open to serious ques- 
tion. The basic data are threshold scores. While the derived scores may 
appear logical, supra-threshold seeing is not the same phenomenon a$ 
threshold seeing. Apparently, as illumination intensity is increased, One 
soon reaches a level of diminishing returns where further increase 1S © 
no practical importance or may introduce harmful factors from the 
viewpoint of easy and comfortable seeing. 

Nervous muscular tension.: Luckiesh and Moss (11, 12), place great 
stress upon the apparent decrease in nervous muscular tension during 
reading as the illumination intensity is increased from 1 to 10 to 
foot candles. Tinker’s (22) analysis of their data reveals that the metho 
employed to present their results magnifies minute differences $O tha 
they appear large. Interpolation shows only gradual changes from 
to 20 to 25 foot candles and very slight changes from there on to | 
foot candles. _The conclusions that high foot candles are needed for 
ordinary reading is not valid. In a comparable situation, Tinker (25) 
found that for reading 10 point type, the critical intensity was about 
foot candles. Below this level, rate of reading was retarded and fatigue 
increased, but for higher intensities there was no change. For peop © 
with normal vision, 10 to 15 foot candles should provide a satisfactory 
margin of safety for reading legible print. 

Frequency of blinking. Another favorite criterion employed by 
Luckiesh and Moss (11, 12) and Luckiesh (10) as a basis for prescribing 


illumination intensities for visual work is frequency of blinking. The 
typical experiment is to measure the rate of involuntary blinking for m 
first and for the last five minutes for an hour’s reading under 1, U» er 
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10 and under 100 foot candles of light. They note that the blink rate is 
greater under the 1 than under 10, and greater under 10 than under 100 
foot candles. Therefore, it is concluded that relatively high intensities 
are desirable for reading. Even if these data are accepted as valid, we do 
not know where between 10 and 100 foot candles the curve of increased 
efficiency flattens out since intermediate intensity values were not 
studied. But there are several sources of information which suggest that 
blink rate is not a valid criterion of ease of seeing: 


1. McFarland, Holway, and Hurvich (18), after a searching analysis of 
their own extensive experiments and of other studies, state: “A high blink-rate 
need mean neither an increase in fatigue nor an increase in diffculty of seeing.” 
They conclude that “the rate of blinking can hardly be considered as a valid 
index of visual fatigue.” 

2. Tinker (32), in a study that has some bearing on the subject, found that 
frequency of blinking is an inadequate criterion of readability of print. 

3. Bitterman (1), working with 3 and 91 foot candles of light, found that 
when subjects read for 40 minutes there was no significant difference in rate of 
blinking. In fact the frequency of blinking was slightly greater under the 91 
foot candles. Incidentally, Bitterman also found no significant difference in 
blink rate for reading large type vs. small type. His results, therefore, indicate 
that rate of blinking cannot be employed as an index of ease of visual work. 
Further studies by Bitterman and Soloway (2, 3) showed that frequency of 
blinking is unrelated to duration of visual work or to the presence of a relatively 
intense glare source in the visual field. The reports of McNally (19) and Mac- 
Pherson (16) also cast doubt upon the validity of blinking as an index of ease 
of seeing. 

4. The statistical treatment employed by Luckiesh:and Moss (11, 12, 14) 
upon their data is open to severe criticism. Tinker (28, 29) has questioned the 
appropriateness of the geometric mean which they employ in most comparisons. 

e same criticism is raised by Hoffman (9). In a searching analysis, Hoffman 
also severely criticizes the use of the percentage technique employed by Lucki- 
esh and Moss for presenting data, and for basing conclusions on percentage 
differences rather than on raw score differences. Percentage scores are notori- 
Cusly unreliable. Furthermore, if the raw scores are below 100 (as most of 
them are), percentages magnify the differences. When percentages are used, 
therefore, ‘the observed differences may be largely an effect of the derivation. 

Nsignificant raw score differences may seem large when put into percentages. 

or instance, a typical average of 30 blinks during 5 minutes of reading is in- 
Creased 10 per cent by a change of 3 blinks. Hoffman further points out that 
work decrement may be a more important variable than illumination changes 
in the results of Luckiesh and Moss. In general, he found little support for the 
Contention that relatively high intensities are needed for effective and easy 
Seeing, 
_ 5. Eames (5) criticizes Luckiesh and Moss (14) for using relatively few sub- 
Jects in their experiments (including blink rate studies) and for employing 

test wise” subjects. As pointed out by Eames, “People who take tests re- 
Peatedly in a given field gradually learn what is expected of them” and are un- 
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" ; ch con- 
intentionally influenced by this knowledge. Results obtained under a 
ditions cannot be representative of the reactions of the general popu 


The accumulated evidence indicates that rate of blinking eann 
accepted as a criterion for specifying intensities of light for visua wat 

Decrease in heart rate. Luckiesh (10) and Luckiesh and proce 
12, 14) cite data on change of heart rate while reading for ean eit 
under 1 foot candle and under 100 foot candles of light. No oF ee 
presented for intermediate levels of illumination. It is eo ae 
heart rate decreased 10 per cent under the 1 foot candle and 2p D 
under 100 foot candles. The conclusion was that from the verp e by 
ease of seeing the 100 foot candle level is desirable. An experimen 
McFarland, Knehr and Berens (17) was designed to check the fin as 
obtained in Luckiesh’s laboratory. The results led to the CONG eo 
that “It is questionable whether reliable criteria for debating erie 
quate levels of illumination for tasks such as reading during lle + 
ods of time (approximately 2 hours) can be obtained in terms pite 
heart rate . |.” Another check experiment was carried out PY ici 91 
man (1), who recorded heart rate while reading under 3 ano oe of 
foot candles of light. “The results do not support the conc u ae 
Luckiesh and Moss with respect to the value of heart rate asa must 
“of the ease of visual work.” In view of the above evidence jen for 
reject heart rate as a criterion for prescribing illumination intensi 
visual work. : nd 

Decrease in convergence reserve. Luckiesh and Moss (11, 12, ee 
Luckiesh (10) cite data on decrease in convergence reserve O ne 
muscles after reading for one hour under 1 and under 100 foot mi ta are 
light. The decrease was less under the 100 foot candles. No se know 
given for the range between 10 and 100 foot candles. We do 3 n such 
therefore, whether the 100 foot candles is significantly better tha 
levels as 20 or 30 foot candles. g d Moss 
Visual adaptation. Throughout their writings, Luckiesh ait levels 
(10, 11, 12, 14) emphasize that the eyes evolved under ba gl light 
of illumination and suggest the desirability of competing with i eyes 
by artificial means. They consistently ignore the fact that the © 


; À illumina 
readily adapt to easy and effective seeing over a wide range of illu 
tion intensities. 


Summary on intensity of illumination. Examination of the dan 
ployed by Luckiesh and Moss as a basis for specifying foot candle Its 25 
for visual work reveals a general lack of validity of these resu sion 
criteria for ease of seeing. The data from visual acuity, muscular te he 
and visibility measurements are misinterpreted or misapplied. f lack 
blink technique and rate of heart beat must be rejected because © 5 0 
of confirmation by independent workers, Furthermore the metho 
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statistical analysis employed are frequently at fault. Any science of 
seeing based upon such an unstable foundation must, therefore, lack 
validity. Since these data have been the justification for specifying 
what appear to be excessively high levels of illumination intensity, we 
must reject such specifications unless justified by valid evidence from 
new experimentation. 


LIGHTING CODES 


School lighting. The American Recommended Practice of School 
Lighting (35) specifies the following minimum foot candles in service: 
15 for classrooms, shops and offices; 25 for sewing and drafting rooms; 
and 30 for sight-saving classes. There is general agreement on the im- 
portance of hygienic illumination in reading and study situations. The 
recommended foot candle levels seem satisfactory in view of research 
findings other than those cited in the code. There should be, of course, 
a sound experimental basis for recommendations of this kind. Tinker 
(23) has pointed out that the recommended practice for school lighting 
is based upon conclusions derived from misinterpreted experimental 
results, Fortunately, the recommended practice is adequate in spite of 
inferences from inadequate data. 

In a later publication by Sturrock (21), the foot candle levels are 
not in an approved code but are listed as the levels found desirable in 
the experience of successful business institutions, i.e., good present-day 
practice, For schools the foot candles listed include: 30 for study halls, 
class rooms, general laboratories, general manual training; 50 for draw- 
ing room, close work in laboratory, sight saving classes; 100 (considered 
especially low) for close work in manual training, and in sewing rooms. 
It is obvious to the impartial person who knows the field that these 
Suggestions represent more intense illumination than is necessary for 
adequate seeing in the school situation. Data summarized by Tinker 
(24) and additional experimental evidence (25, 27) indicate that about 
15 foot candles are adequate for ordinary schoolroom tasks and that 
25 to 30 foot candles are satisfactory for the more severe tasks. Justifi- 
Cation for the higher intensities is sought in the discussions of Luckiesh 
and Moss (12, 14) and’ Luckiesh (10). These have been evaluated above. 
_ Office lighting. The Recommended Practice of Office Lighting (36) 
includes the following foot candle levels: 50 for difficult seeing tasks 
Such as accounting, bookkeeping, and drafting; 25 for ordinary seeing 
tasks such as general office work, private office work, mail rooms; 10 
for casual seeing tasks such as reception rooms and washrooms; 5 for 
Simple seeing tasks such as halls and stairways. Considering the se- 
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verity of the tasks performed by some workers in general offices and 
special (as accounting) offices, the above recommendations are satis- 
factory. The 50 foot candles, however, should be considered liberal even 
for the difficult seeing tasks. The statement that “higher values will 
contribute greatly to accuracy, speed and ease” cannot be accepted as 
valid. 

Sturrock’s (21) sumniary of good present day practice does not devi- 
ate markedly from the recommended practice except that typing and 
prolonged reading of shorthand notes are listed at 50 foot candles and 
intermittent reading and writing at 30 foot candles. Each of these 4 
about twice what is needed in terms of the visual task. The basis for 
the higher intensities is in terms of the discussions of Luckiesh an 
Moss (12, 14). The inadequacy of these data has been pointed out 
above. á 

Industrial lighting. A wide range of illumination intensities 18 
recommended for various tasks in industry (37). Among the higher foot 
candle recommendations are: over 100 foot candles for such operations 
as extra fine assembly, automobile finishing and inspecting, cutting an 
sewing dark goods, engraving, proofreading, final inspection of ug 
casings, grading and sorting tobacco products, and certain inspection 
work in textiles; 50 to 100 foot candles for such operations as automobile 
assembly line, glass works inspection, fine inspection, bookkeeping, font 
assembly-sorting in printing industry, tin plate inspection, and stitching 
dark leather. With regard to all the recommendations, one is cautione i 
that the foot candles are minimum operating values and that in almost 
every instance higher values may be used with greater benefit. f 

It is stated that the recommendations are taken from a series 0 
studies on the illumination needs of specific industries, or, if not avail- 
able there, from current good practice. Examination of these studies 
(listed on page 23 of the report) indicates that in the main they are 
surveys rather than experiments. Furthermore, there is a lack of ade- 
quate descriptions of the survey techniques employed. In a few ur 
stances a general description of methods was given. Apparently what 
happened was first to make a survey of practice. This was followed bY 
some sort of job analysis to determine what had to be discriminate": 
Then by reference to research studies (such as those reported by Luck- 
iesh and Moss in their books) the intensity level of illumination pe 
sumably needed for the specific job was deduced. This method has 
some virtue provided sound data are referred to, which was not done i 
these cases. In a few instances it is stated that visibility measurements 
were made. Occasionally installations to achieve the recommendations 
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were made, the effect observed and additional modifications made. In 
no case was there experimental determination of the light intensity 
needed. 

There are no valid experimental data which indicate that more than 

50 foot candles are needed even for those practical visual tasks which 
approach threshold discrimination. Furthermore, as pointed out by 
Harrison (8), visual comfort may decrease under high intensities. 

„` Home lighting. The most recent recommended practice for home 
lighting (38) specifies intensities ranging from 10 foot candles on card 
tables to 100 and more for sewing on dark goods. Forty foot candles are 
recommended for such situations as children’s study table, kitchen work 
counter, laundry, and for prolonged reading. There is no, valid reason 
for going above 25 to 30 foot candles for the more severe visual tasks in 
the home (24). Approximately 15 foot candles is adequate for many 
of these visual tasks. Figure 1 in Recommended Practice of Home Light- 
ing (38) is misleading. “This chart shows the extent to which occupa- 
tions and poor seeing conditions leave their mark on eyesight.” The 
implication is that poor illumination causes ocular disability. There are 
no valid data which indicate this to be so. This chart represents an un- 
justified form of propaganda. 

Present-day practice. Sturrock (21) has assembled foot candle levels 
of illumination which are labeled ‘good present-day practice.” The 
tables are preceded by a classification (after Luckiesh and Moss) of foot 
candle needs for visual discrimination of tasks varying in difficulty. The 
Material is apparently designed as a guide but is not necessarily in the 
form of recommendations. This sort of thing is valuable in many ways. 

But since it is based to a considerable degree upon the material pre- 
Sented by Luckiesh and Moss (12, 14) and by Luckiesh (10), the illumi- 
high in some instances—as 100 foot 
and 50 foot candles for reading 
t should be pointed out, however, 


Nation intensities are excessively 
candles for sewing and proofreading, 
Small type and for kitchen counters. I 
that much of the material is fairly satisfactory. 

In general, recommended practice prior to 1940 (35) is fairly ade- 
quate, but as new codes are issued at later dates the apparent tendency 
has been to recommend as intense lighting as the traffic will bear. This 
Is justified by referring to the work of engineers (largely Luckiesh and 

oss) who state that these high intensities are nevertheless inadequate 
Or easy seeing. As pointed out above, both the experiments and the 
Conclusions which are cited as fundamental are frequently invalid. Fur- 
thermore the data are out of line with other independent experimental 


results, 
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VisuaL Factors 


Eye disabilities. It is generally accepted that eyes with disabilities, 
even when corrected by glasses, need brighter light than normal eyes for 
adequate visual discrimination. Ferree and Rand (6) and Ferree, Rand 
and Lewis (7) are usually cited as supporting evidence. In the first 
study (6), it was found that apparent diopters of accommodation 1n- 
creased more for 14 presbyopes than for normal eyes in going from 1go 
5 to 25 foot candles of light. Interpolation indicates that for the normal 
eyes the curve of improvement shows little rise after about 8-10 foot 
candles; for the presbyopes, after about 15 foot candles. In addition, 
one myope and one presbyope were compared with a normal subject by 
measuring apparent diopters of accommodation at 13 intensities from 
0.5 to 100 foot candles. The curve of efficiency for the normal person 
improved rapidly to 5 foot candles, then more slowly to about 20 an 
very gradually thereafter; for the myope there was considerable 1™- 
provement to about 20 foot candles and little thereafter; for the pres- 
byope there was considerable improvement to about 38 foot candles 
and then slower improvement to 100 foot candles. It is of course 1m 
possible to generalize from one case, but apparently those 
abilities need somewhat brighter light than normals for 
This does not mean that they need 100 foot candles or more, as 
people wish to imply. 

In the other study (7) Ferree, Rand and Lewis were concer 3 
distant (20 feet) vision. The visual acuity for 4 presbyopes was ee 
pared with acuity for 3 normal people. The presbyopes continue = 
gain in visual acuity from 25 to 100 foot candles while the normal o% 
made little pain within this range. Since there: ia fittle or NOTRE” i 
between acuity of distant vision and acuity at near vision, these rest k 
have no bearing upon visual discrimination at the work surface ye i 
work bench, etc.). Furthermore, oné should not prescribe illuminat! a 
for suprathreshold tasks in terms of threshold measurements (VIET 
acuity). There is no evidence from these studies which implies A 
excessively high foot candles are necessary for those with ordinary ke h 
disabilities. Rather, they suggest a moderate increase for those W' 
corrected vision as compared with normal eyes. „a adapt 

Visual adaptation. It is well established that the eyes readily 4 nce 
to easy and effective seeing over a wide range of illumination intens! ina” 
This adaptation is rather slow in going from bright to dimmer ome 
tion (for practical purposes, 15-20 minutes) and rapid in gong ated 
dim to bright illumination (1-3 minutes). Tinker (25) has demonstt 
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that when adaptation is incomplete on shifting to a lower level of il- 
lumination, speed of perception is retarded. When adaptation is ade- 
quate, however, visual perception in reading is fully effective from 3 
foot candles up for normal eyes in reading legible print. In another 
study, Tinker (26) showed that subjects tend to prefer for reading ap- 
proximately the illumination intensity to which they have been adapted, 
whether it be 8 or 52 foot candles. These data indicate that readers 
tend to consider comfortable for easy reading any one of a wide range 
of illumination intensities provided such intensities are above critical 
levels and provided visual adaptation is adequate. Codes of lighting 
have consistently ignored the role of visual adaptation in seeing. They 
carefully point out that the eye has evolved under the bright illumina- 
tion of daylight, but do not mention that the eye also evolved to see 
adequately at low as well as at high intensities of light. 


ILLUMINATION FOR ADEQUATE SEEING 


Critical levels of illumination. The critical level of illumination is the 
intensity beyond which there is no further increase in efficiency of per- 
formance as the foot candles become greater. Tinker (24) has sum- 
marized the data for critical levels of illumination: for reading of legible 
print (about 10 point on good paper) by adults, it is approximately 3 
to 4 foot candles; for reading and study of children, 4 to 6 foot candles; 
for arithmetical computations, less than 9.6 foot candles; for sorting 
mail, 8 to 10 foot candles; for the exacting task of setting six-point type 
by hand, 20-22 foot candles; and for very fine discrimination required 
to thread a needle, 30 foot candles. In a later study, Tinker (27) found 
the critical level of illumination for reading newspaper print to be about 
7 foot candles. Employing intensities from 2 to 55 foot candles, Rose 
and Rostas (20) found that reading efficiency, in terms of speed and 
comprehension, did not increase by a measurable amount with increased 
intensity of illumination. 

Adequate levels of illumination. It is obvious that visual work should 
not be done at critical levels of illumination. There should be an ade- 
quate margin of safety to provide for individual variation and the like. 
For such visual tasks as reading good-sized print (10 to 11 point) ona 
good quality paper, i.e., print of good legibility, 10 to 15 foot candles 
Should provide hygienic conditions when one’s eyes are normal. For 
Situations comparable to the reading of newsprint, 15 to 20 foot candles 
Should be adequate. In situations involving the reading of handwriting 
and other comparable tasks, 20 to 30 foot candles seem desirable. For 
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aie comparable to discrimination of 6 point type, there should be 30 
to 40 foot candles. And for the most severe tasks encountered in work- 
day situations, 40 to 50 foot candles will be found adequate. There ip 
no valid experimental evidence now-available that indicates a need for 
over 50 foot candles intensity for adequate visual discrimination. The 
intensity values from 10 to 20 should be increased somewhat (5 to 10 
foot candles) for eyes with slight disabilities or for those with on 
tions. For the higher values, however, no practical gain will be rosea 
for these people by increasing the intensity. The above suggestions ho h 
for school children as well as for adults. In general, the child has muc 
less severe visual tasks than adults. ae 

Intensity of illumination cannot be prescribed without coordinating 
it with other factors such as distribution of light and brightness contrast. 
A good example of the uselessness of excessively bright light is found in 
the study by Darley and Ickis (4). They were concerned with vision 
in the drafting room, a very severe visual task. In comparing 30 re 
75 foot candles of indirect light, they found the efficiency ratings a 
the two to be only slightly different. When they compared 40 with 
foot candles of direct light (troffer) under conditions of no cele 
glare, they also found no significant differences in the efficiency pene 
The observations of Harrison (8) are relevant here. He points out. a5 
danger of glare with installations of 50 foot candles and above of arti- 
ficial illumination. 


SUMMARY 


Examination of the literature upon which lighting recommendath 
are based reveals that some techniques of experimentation are inva a 
and that interpretations from certain other data are u nwarranted. ee 
of the recommendations are adequate, others are not. The trend see 4 
to be to specify as high intensities as the traffic will bear and at the eo 
time to suggest to the consumer that if he uses still higher intense 
he will improve his ease of seeing. All will agree that there should 9 
sufficient light for adequate seeing. It is high time, however, that ut 
consumer know what is adequate and what is surplus. As pointed 0 
by Winslow (34), illumination should conform to real human needs. 
is human health and comfort which are at stake. ] 

In general the recommended practice concerning distributio 
light, brightness contrast and color of light is satisfactory. 
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ON FESTINGER’S EVALUATION OF SCALE ANALYSIS 


LOUIS GUTTMAN 
Department of Sociology and Anthropology, Cornell University 


: The theory of scale analysis had its origin some seven years ago. 
Since that time, especially by virtue of extensive and intensive research 
done in the Army, some of its further ramifications have been explored 
and several techniques have been devised for carrying an analysis out 
in practice. The power and incisiveness of this approach have been 
demonstrated in numerous attitude and opinion surveys made in the 
past several years, as well as in studies of achievement tests. A pleasing 
feature has been the simplicity of the techniques involved. 

Most of the material, with respect to both applications and theoreti- 
cal developments, is as yet unpublished. A manuscript has been pre- 
pared by Edward A. Suchman and the writer which will give the first 
comprehensive statement of both the theory and practice of scale analy- 
sis, This manuscript will form part of the four volumes soon to be pub- 
lished by the Social Science Research Council on the work of the 
Research Branch, Information and Education Division of the War De- 
partment, These volumes will also provide many illustrations of how 
scale analysis has been used for practical problems. Meanwhile, some 
brief statements of the principal concepts and instructions for practical 
procedures are available in article form to those who wish to use this 
approach in their own research (see the bibliography below). 

3 On the basis of some articles which have been published and of some 
mimeographed progress reports, Festinger (1) has recently attempted a 
survey and evaluation of scale analysis. Since his survey is not based 
on all the information available, it is admittedly tentative and incom- 
plete. In addition, full advantage has not been taken of the material 
which Festinger used as his sources; he raises a number of points which 
have already been answered there, and also introduces erroneous inter- 
Pretations and conclusions. 

It seems worthwhile to discuss at the present time some of Festing- 
er’s criticisms in order to help clarify the issues and to correct some 
Important misapprehensions. Attention is also called to some articles 
that have appeared since Festinger prepared his paper, discussing vari- 
ous aspects of scale analysis (10, 11, 13). 

Three of Festinger’s points will be analyzed here: (a) criteria for 
Scalability, (b) techniques of analysis, and (c) the use of scale analysis 
in practice. In the course of the discussion, some other aspects will be 


brought out which Festinger has not considered. 
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CRITERIA FOR SCALABILITY 


Reproducibility. The main purpose of scale analysis is to test the 
hypothesis that a universe of qualitative items can be represented bya 
quantitative variable. In order for the universe to be represented eg 
actly by a quantitative variable, each item must be a perfect function 
of that variable, or be perfectly reproducible from it. Thus the concept 
of reproducibility is paramount in scale analysis. 

In practice, only a sample of items is used from the universe of a. 
tent. Furthermore, in practice, it is not expected to find perfectly #4 
producible or scalable: universes, Among other things, perfect eee 
ducibility implies perfect test-retest reliability, which is certainly no 
to be expected empirically. However, if the reproducibility of the entire 
universe is very high, say over 90%, then that may be sufficient for 
many practical purposes. A quantitative variable which will represent 
an indefinitely large universe of items that well will ordinarily not Er 
much predictive power, whether used for predicting outside variab fe 
or whether predicted from outside variables. This will especially be tru 
if the errors of reproducibility are random. , nly 

Since universe reproducibility must be estimated on the basis of is 
a sample of items, it becomes evident that the sample's reproducibi ee 
alone may not be a sufficient guide. Festinger criticizes the sample T ; 
producibility coefficient for its inadequacy.* This inadequacy was a 
nized at the outset in scale analysis. The same kind of examples A 
Festinger uses (1, pp. 156-157), showing how five or nine statistica ed 
independent items can have high reproducibility, were worked out cae 
ously; several such examples will appear in the forthcoming volun ‘ 
Indeed, there is an even worse case than that of statistical independen®™ 
namely that wherein some items have negative relationships with aces 
this is worse than being statistically independent from the point of V! in 
of scale analysis. Examples can be constructed showing how ee 
this case it is possible to have supriously high reproducibility in a sM 
sample of items. spifity 

Festinger omits to point out that this problem about reproducibi K 
was raised before, and that several answers have already come fort): 


A re 
In one of my mimeographed reports to which Festinger refers (15), the 
is the following question and answer: 


O. is reproducibility by itself a sufficient test of scalability? tures 
A. No. It is the principal test, but there are at least three other fea 


; PRE, 5 jzance 
* Hausknecht (12) has raised this Criticism earlier, also without taking cog”! 


of the fact that other criteria have always been used as discussed below. 
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that should be taken into account: (a) range of marginals, (b) random scatter of 
errors, (c) number of items in the sample. 


Further questions and answers elaborate on the point. And again, in 
another paper (7) to which Festinger refers, it is stated: 

The percent reproducibility alone is not sufficient to lead to the conclusion 
that the universe of content is scalable. The frequency of responses to each 
Separate item must also be taken into account for a very simple reason. _Re- 
Producibility can be artificially high simply because one category in each item 
has a very high frequency. It can be proved that the reproducibility of an 
item can never be less than the largest frequency of its category, regardless of 
whether the area is scalable or not. ' 


And further: 

An empirical rule for judging the spuriousness of scale reproducibility has 
been adopted to be the following: No category should have more error in it 
than non-error. 


If this latter rule alone were applied to Festinger’s examples, it would 
immediately reject the hypothesis that the items are from scalable uni- 
verses. The consideration about pattern of error would also disqualify 
the hypothesis that the items were from scalable universes. 

An Alternative. One contribution to spuriously high reproducibility 
is the fact that each item is being related to a score which is based in 
Part on the item, An alternative way to compute the coefficient of re- 
Producibility is to hold out each item in turn from the analysis, thus 
obtaining N sets of trial scale scores. The errors for each item can then 
be counted from its relationship to the score based on the N—1 other 
items, ; 

_ If this partial-score method were used on statistically independent 
items, then the reproducibility for each item would be precisely the 
Telative frequency of its modal category. Thus, in Festinger’s example 
(1, Pp. 156) of five independent dichotomies with marginals 80%, 60%, 
50%, 40% , and 20% the respective modal relative frequencies are 80%, 
60%, 50%, 60%, and 80%; hence, the reproducibility of all five items, 
Computed from partial scores, would be the mean of the latter five per- 
Centages or 66%, compared with the spurious 86% Festinger obtained 
‘rom whole scores. Indeed—no matter what the interrelations of the five 
tems were—their reproducibility could not be less than 66%, because 
reproducibility of an item can never be less than its modal frequency. 
Similarly, in Festinger’s second example (1, p. 157) of nine statistically 
independent dichotomies with marginals .9, .8, 17, 26, «55-4, Siegen 
Tespectiye modal proportions of the items are .9, .8, .7, .6, .5, .6, «7, -8, 
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<E inger , 
and .9, so the reproducibility of the set cannot be less -p ape 5 a. ; 
finds .83 reproducibility from whole scores whereas i par 
used the obtained reproducibility would be .72. ae ia 

Items with extreme marginals like .9 and .1 do not he nara 10% 
testing reproducibility since such items can never have more t 
aa practice, it does not usually seem worthwhile to bother w Poa 
tial scores, although this technique is available for doubtful o io be 
fictitious examples of independent items do not illustrate d gene 
expected in practice. Attitude (or achievement) items of the > E 
eral content are usually sufficiently correlated so that scores ee, i 
eleven of them will not be substantially different from scores basea 
twelve. Reproducibility from whole scores will not be maaa over 
than from part scores—so their spurious excess of reproducibi oe pate 
that from part scores can be largely ignored, Furthermore, pet ae | 
score reproducibility is not a sufficient test of scalability, for 
ditional criteria mentioned above must also be considered. _ ity when 
There is room for more improvement on criteria for scalabili Y aval 
samples of content are used, but it should be made clear that pees 
bility by itself has not and is not the sole basis for drawing en ibility | 
from a sample of items. It is the basic one, because the mee riteria 
of the universe is essentially what is in question, but additional ¢ 
have been and are being used. „pected 
Reliability. The suggestion that Festinger makes that the sE of 2 
occurrence of scale responses be calculated under the assumptio ene 
perfect scale plus a certain degree of unreliability is a promus all 
This idea had been thought of in the earlier stages of the develop 


nod, THE 
of scale analysis but discarded in the form Festinger has suggested ted Ñ 
proportion of people with no scale errors cannot be properly ca : 
by the method that F 


š i 4 
estinger uses, Apparently he assumes that tie | 
the proportion of population responses that will be in the scale hae the | 
for one question, then the proportion that will be jointly wit 1 the | 
scale pattern for seven questions is (.9)7 or 47.8%. Unfortunate a 
same reasoning would say that the proportion of people who ia the 
responses outside the scale pattern should be (.1)7; and in gener 


A hou 
proportion of people with X scale responses and 7— X scale errors $ 
be given by the binomial distribution 


7! 
K = moore. 


ki 
— 


r m scale 
But this is impossible, for nobody can have all his responses as 


yf 


i 
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errors. Indeed, for the empirical example that Festinger borrowed (1, 
Fig. 2, p. 157), no matter what pattern of response a person may have, 
he can be placed into one of the scale patterns with at most four errors. 
Therefore, the range of possible errors for each person is 0 through 4, 
rather than 0 through 7 as Festinger supposes. This means that Fes- 
tinger’s calculations cannot be carried out consistently to estimate re- 
producibility under the given assumption. The difficulty is that whether 
a person will fall into the scale pattern is not independent of whether 
another of his responses is within the scale pattern. Unreliability does 
not behave that way with respect to the scale pattern. 

The actual reproducibility of this example of seven questions was 
about .85 rather than the .9 Festinger assumed. It is interesting to note 
that (.85)7 is .32, which is not far from the “over one-fourth” perfect 
Scale types reported. Actually, the universe sampled by these seven 
questions would not now be accepted as sufficiently scalable but would 
be broken up into sub-universes; the study was made when 85% re- 
Producibility was the empirical rule rather than the present 90%. The 
Study did serve its purpose well, however, as collateral evidence pre- 
sented there showed. 

The further calculation that Festinger makes of adding 3.7% to 
his 47.8% scems based on an unfortunate double usage of the word 
“chance.” In his second paragraph on p. 158 (1), “chance” is used to 
mean statistical independence between items. Such independence can- 
not exist simultaneously with the assumption of a scale pattern in his 
following paragraph; that is, the 7% who fall into perfect types under 
the hypothesis of independence of items have nothing to do with the 
distribution of error under the assumption of uni-dimensionality plus 
unreliability. The binomial distribution by itself—if it were correct— 
takes care of the second situation. Hence, Festinger’s calculations are 
incompatible in adding 3.7% (7% of 1—.478) to 47.8% to obtain 52.2% 
as the “chance” proportion. The 7% is correct for independent items; 
the 47.8% would be correct for the scale-plus unreliability case if the 
3 inomial hypothesis held; and the two cases do not hold simultaneously. 
‘Chance” means something different in each case. 

A consistent use of reliability. Several correct approaches to the 
use of the concept of unreliability are possible, instead of the incon- 
Sistent binomial approach. One such approach will be sketched here 
briefly for the case of dichotomies. Let # be the number of dichotomies 
1 the sample of items so that there are n+1 scale types or ranks pos- 
sible, Let + be the rank of the type that is “positive” on r of the items; 
f ranges from 0 to x. Let Pr be the proportion of the population whose 
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“true” rank on the x items is r, and let Prj be the probability a person 
of “true” rank r will be “positive” in the jth item (G=1,2,.--5 n): 
There are 2” types of people—scale and non-scale—possible on E 
dichotomies. The expected proportion in each of the 2” types can be 
calculated from the Pr and Prj. Conversely, from the observed 2” pro- 
portions in an actual experiment, the Py and Prj can be estimated. Thi 
are n+1 parameters Pr, of which n are independent since their sun 
must be unity. There are (n+1)n parameters Prj, all of which are inde- 
pendent. Hence, there are x+(n+-1)z, or n(n-+2), independent Sage 
eters to be estimated from 2"—{ independent observations. If 7 
greater than 5, this provides more equations than there are va F 
so the hypothesis of the scale structure can be tested, as well as having ae 
parameters estimated. Unfortunately, the equations involved in PA 
above analysis are curvilinear, and do not seem to lend themselves 3 
practical use because of the difficulties in the numerical compu an a 
Furthermore, even this analysis has been simplified by assuming t i 
persons within the same “true” rank were equally reliable within ae 
item. Without this simplifying assumption, the equations would ha 
innumerably more*parameters. E E cji 
In any analysis using the concept of test-retest reliability, it E 
be remembered that scalable data must in general be highly reliable, y 
though the converse is not necessarily true. The coefficient of repro 
bility—especially if computed by the part-score technique descri 6 
above—sets a lower bound to the average reliabilities of the ten 
and especially 8). In particular, if items are perfectly reproducible, t ae 
are perfectly reliable. Hence, Festinger errs in his assertion that Ev a 
if a perfect scale were achieved these claims [concerning invena 
properties] would all be limited by the degree of reliability . . . of ms, 
questions asked” (1, p. 160). Perfectly scalable data are perforce Piel 
fectly reliable. Conversely, highly unreliable data cannot be scala ily 
One of the contributions of a scale analysis is to provide automatica 


: . hea ies a . ch 
information about reliability by helping set a lower bound to it for ea 
item. 


The simple criteria used in con 


P A ibili 
junction with that of reproducibility 
for samp 


le data do serve to distinguish between data that are high? 
scalable and those that are not. The case where the items are in E 
pendent will always be rejected on the basis merely of the criterion E 
improvement, namely, that no category should have more errors ua 

non-errors. The further criteria of studying patterns of error also teni 
to insure that no dominant second variable is present even if reproduc 
bility is high. That is what is meant by the statement that “in imperfe¢ 


~o 
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scales, scale analysis picks out deviants or non-scale types for case 
studies.” If no non-scale types have substantial frequencies, then that 
tends to indicate that there is no substantial second factor present. 
However, if one or more non-scale types do have a substantial frequency, 
then that is an indication of where an additional factor (or factors) is 
entering into the picture. If an additional factor is sufficiently promi- 
nent, it may be worthwhile to try to piece it out further by asking ad- 
ditional questions. The universe might be divided into two or more sub- 
universes, each of which may be scalable separately. Or it may turn 
out that the additional factor is so highly correlated with the most domi- 
nant factor that it does not make much difference whether they are 
treated as two separate variables or as a single variable. 

The problem is not to find out whether a perfect scale is present in 
Practice, but rather whether it is worth worrying about any additional 
variables that may be present. The criteria used in practice are believed 
to provide an answer to this and to decide properly whether or not a set 
of data can be regarded as sufficiently scalable for most practical pur- 
Poses, 

Quasi-scales. One kind o! 
A quasi-scale is different from a sca 
but in the entire pattern of responses. 


stood the definition of a quasi-scale, 
fers from a scale only with respect to reproducibility (1, p. 156 and p. 


159). A universe which is quasi-scalable will ordinarily have less than 
85% or so reproducibility, but that is not its distinguishing feature. The 
distinguishing feature is the gradient in the responses to the items. Cutting 
Points cannot be established (as in the case of a scale) which will enable 
ne to say that a person above the point is in one category of an item 
and a person below the point is in another category; but one can state 
that, if one person is higher than another in the quasi-scale, then his 
Probability of being in a higher category of an item is correspondingly 
&reater, 
There are many kinds of configuration which are less than 85% or 
90% reproducible and which are not quasi-scales at all. For example, 
an area may have two or more dominant factors in it, in which case it 
Would not be either a scale or a quasi-scale. In a quasi-scale, there are 
ne dominant factor and infinitely many small factors. The order of 
People in a quasi-scale is according to the dominant factor, and is es- 
Sentially invariant from sample of items to sample of items, provided 
that the samples are large enough. There is a great deal of work yet to 
© done on the theory of quasi-scales, but enough is known to say that 


f non-scalable universe is dalled a quasi-scale. 
le, not just in the reproducibility, 
Festinger seems to have misunder- 
for he seems to believe that it dif- 
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zinds 
they have quite a different character from scales and on ae i, 
of universes. Another distinguishing feature esa ee ate 
uasi-scale is that the scale has an intensity function and fu tent 
ineful components, whereas a yore not have a 
i further components of that kind. : Es 
oe phenomena have been found to be a a 
ample, the Neuro-Psychiatric Screening Adjunct, y hic Posh ee 1A 
paper and pencil test used at all military stations n oa A 
is a quasi-scale and is a product of a rigorous investiga io Ee 17). 
screening tests made possible by the scale analysis approa' 


TECHNIQUES FOR SCALE ANALYSIS 


; avail- 
Scalogram devices. There are several alternative schemes ee pii 
able by which to carry out a scale analysis in practice. paaa ome 
ally equivalent in terms of the results they yield, but they Hl al device 
what in operation, Scalogram boards have been the ee Atl ai 
used by the War Department, and are perhaps the most : A operates 
easiest to use. The boards are relatively simple to make and fiat or not 
the cost depends upon how largea board is desired and whe then the 
a pair is to be made. If a single board is used instead of come faicly 
workmanship need not be precise and the board can be ma E 
cheaply by any carpenter. There are alternative mechamra m Ka 
that might be used instead of the wooden board, and un n easier 
other schemes will be invented in the future which will be eve logram 
to construct. Instructions for the construction and use of a sca 


. the Re- 
board will appear in the forthcoming volumes on the work of 
search Branch. 


The Cornell technique (7 
a course on attitude 
no background what 
all items are dichot 
Cornell technique is 
there is no problem 


in 
‘ st ig taught i 

) is also very easy to learn; it is t ho have 
where 
he 


and public opinion analysis to students y 
soever in statistics. For achievement tests, g—t 
omous—being marked either right or U 
perhaps the best of all to be used. For giera a one 
of combination of categories, so that there 1S pit in 
trial to be made in an analysis. The Cornell technique suffers È has 
flexibility compared to the scalogram board when a series of pes 
to be made. Ordinarily, but two trials may be needed ais oh cases 
and the Cornell technique has proved very advantageous in suc! ment 
for general research purposes, It can be carried out on IBM equip 
as well as by hand. «++ tabulation 

The Goodenough technique (2) is based upon an explicit ta hae 
of all combinations of responses that actually occur. It is more 
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ous” than the preceding two techniques in that it counts the errors at 
each stage. However, it yields no different results in the end. Appar- 
ently Festinger has not worked through the Goodenough technique to 
see how it does work out in practice.* The first step seems simple, but 
it takes a good deal of experience to master the three following steps. 
The process becomes very bulky and involved when ten or twelve items 
are used. 

The Cornell Technique has the advantage that its complexity does 
not at all change, regardless of the number of items (though of course 
the amount of labor increases with the number of items). The same 
lack of increase of complexity holds to a slightly less degree with the 
Scalogram board. 

The problem of metric. The earliest technique for scale analysis was 
that of least squares (3). It is quite properly to be abandoned as a 
procedure in practice because it is certainly far more cumbersome than 
the others. However, the equations involved have turned out to be of 
basic importance in interpreting a scale, and have led in particular to the 
empirical treatment of the intensity function which is proving so vital 
for attitude and public opinion work. Also, the basic thinking behind 
the equations have led to a solution to the related problem of paired- 
comparisons (12). 

In the beginning of my work on scale analysis, I had thought that 
one of the most important problems was that of metric. I had thought 
that how to obtain weights for items was perhaps the leading problem 
to be solved. But as the theory of scale analysis developed, it became 
clear that the problem of weights was essentially a minor one for most 
Practical purposes. Indeed, for the perfect scale pattern, it is easy to see 
that if scores are to be obtained for people by adding up weights assigned 
to categories of items, then, no matter what weights are used—as long 
as they have the proper rank order within each item—the scores of the 
People will have exactly the same rank order. The ordering of people in 
this sense does not depend at all upon finding a particular weighting 


System. 

The important problem tur 
a universe of items must have i 
finding weights. 


ned out to be that of finding the structure 
n order to be scalable; it was not that of 


isread Goodenough as to how to measure reproduc- 

hat “at least 85% of the total number of responses 

Must fall within the scale pattern, SO that it is possible to reproduce 85% correctly all the 

Tesponses of all the respondents from the scale scores” (2, p. 184). Festinger seems to 
ave misread this to mean that 85% of the individuals fall into perfect scale types. 


u k Festinger also apparently has m 
ibility. Goodenough explicitly says t 
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The problem of a metric does turn out to come into the picture for 
further problems, and it first appeared as a practical problem with re- 
spect to that of bias in questionnaire wording (11, 13, 14). The problem 
here was, after people are ranked from a high to a low on an attitude 
or opinion, to find a dividing point in the order such that the people on 
one side can be called positive and people on the other side can be called 
negative. The equations of scale analysis, when applied to the perfect 
scale pattern, show a most remarkable result. They show that a univer se 
of items which is perfectly scalable can be resolved into an infinite series 
of principal components, the first of which provides the basic metric, 
the second of which is the intensity component, and the remaining one 
are as yet not named (10). Empirical study of the intensity function 
has afforded for the first time a scientific solution to the problem of 
question bias. 

These equations, then, show that a scalable attitude is somewhat 
different from the twelve-inch ruler that Festinger uses as an analogy 
(1, p. 160). The responses of a person to items in a scalable universe 2° 
seen by means of these equations to be a function of the person’s metric 
score, his intensity, and the further components in the scale. The per 
son’s rank order is sufficient to reproduce his responses exactly; in this 
euse, the responses of the population are but a function of a single 
variable. Resolving the responses into components by the alternative 
device of the least squares equations shows the responses to be a functio? 


A SY many variables, each of which is a function of the ran 
order. 


A These striking results from using the least squares equations in con- 
junction with the perfect scale pattern will be elaborated on in the forth- 
coming publication on the work of the Research Branch. It mig 
further be pointed out here that these equations resolve also the paradox 
which appears in achievement tests where the difficulty of an item see 

to introduce a factor different from the common content factor that the 
items may have. Since scale analysis applies to achievement tests 3° 
z as to attitude or opinion areas, achievement tests also are resolvable 
into the principal components of a scale. In a scalable achieveme? 
test, then, each item is a function of but a single dimension from tHe 
point of view of reproducibility, but a function of infinitely many oly 
mensions from the point of view of principal components. The appare” 
contradiction between these two points of view is resolved by the a 


that the infinitely many principal components in turn are perfect func- 
tions of the rank order of people. 
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4 Incidence of scales. The theory and techniques of scale analysis pro- 
vide a test of the hypothesis that a universe of qualitative items can be 
represented by a single quantitative variable. This hypothesis is ap- 
propriate for any qualitative universe obtained by any method of ob- 
servation: The universe may be a set of items recorded on a question- 
naire, or observations obtained in non-directive interviews, by partici- 
pant observation, or by any other technique of gathering data. No 
matter how the data are gathered, each observation is but a sample of 
all similar observations that could have been obtained, and the entire 
universe of observations is ordinarily of interest. 

As Festinger suggests, scalable universes may be the exception rather 
than the rule. Festinger does not give any explicit reasons for his belief, 
but this position will be substantiated in the forthcoming volume. It 
has already been pointed out that one possible reason for the existence 
of an attitude scale is that of a homogeneous culture (4, p. 149). Ifa 
Population is not subjected to the same social stimuli with respect to the 
attitude, it might be expected that it will prove to be unscalable for 
them. The fact that neurotic phenomena have not been found scalable 
can perhaps be explained in this fashion. Similarly, an area of achieve- 
ment may be expected not to 


be scalable if there is no uniform program _ 
of training for the population involved. 


Another reason for expecting many universes not to be scalable in 
Practice is that the notion of a universe is so comprehensive. Each 
sub-universe of a universe is of course itself a universe. Since there is 
Ordinarily a vast number of imperfectly related sub-universes, there 
must be a vast number of combinations of them which are non-scalable 
universes. Merely this formal consideration would lead one to believe 
that most universes are not scalable. Non-scalable universes may of 
course be broken down in some cases into scalable sub-universes. One 
of the contributions of scale analysis is to point out the need for being 
clear about the universe’s content. By focusing on more and more 


Omogeneous content, research can be made more meaningful and ex- 
e effective in the long run. 


ternal predictions be made mor 
The development of the above-mentioned screening test for psycho- 


Neurotics (16, 17) is but one example of how research utilizing scale 
analysis was more effective than it would have been had the more tra- 
ditional but less incisive procedures been followed. Instead of throwing 


together all kinds of conceivable predictive items into one composite, 
ntent were defined which might be re- 


fifteen different universes of co 
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lated to the criterion of psychoneuroticism. The structure of each of 
these universes was first analyzed separately. Because each was found 
to be either a scale or a quasi-scale, only a relatively few items from 
each were needed in order fully to utilize the predictive power of the 
universes. The multiple correlation of the criterion was then worked 
out on all fifteen predictors with the finding that one of the universes 
predicted as well as the best combination of the fifteen. This enabled 
the short but efficient screening test to be used with the knowledge that 
it retained the predictive power of innumerably many items in fifteen 
different universes. Such a complete usage of predictive power could 
not have been made without scale analysis. A 
From the practical point of view, another important feature here is 
the amount of labor saved by scalogram techniques in obtaining this 
maximum predictive power, compared to using more traditional tech- 
mques which are far more laborious and which would yield less effective 
predictions, ; 
The two problems, that of scalability and that of external prediction 
are distinct but related. By focusing on the scaling problem in its ow? 
right, more effective external predictions are thereby made possible. 
There are many areas which have been found to be scalable thus far, 
and therefore these areas can be handled economically by means o 
simple scale scores. Many areas have also been found not to be scala- 
ble; all such areas cannot be handled so simply. It is known how to treat 
quasi-scalable areas, and Lazarsfeld is now completing a theory of E 
latent dichotomy which also can be handled by means of a single quant! 
fication. How to utilize other kinds of non-scalable areas is still 
unsolved problem. The emphasis that scale analysis makes in this con 
nection is that unless the structure of the universe is known, it is 7° 
known how best to treat the universe for any particular purpose. 
Distinction between theory and techniques. The basic theor j 
scale analysis is not to be confused with particular techniques for carry 
ing out such an analysis in various kinds of situations. Festinger borders 
on confusing the two when he states that “ ‘scale analysis’ seems to be a 
inique for use with paper and pencil tests or other instanc® 
of measurement where the situation permits the inclusion of sever 
questions centering about the same topic” (1, p. 160). If a re 
problem is concerned with a universe of content, then that univers 
must be studied. That is what the theory calls for. How adequate 1$ u 
technique which Festinger implicitly advocates of studying only a S08 
item from the universe? 


i da er 
One of the important aspects of a universe of content is its structut™®’ 


y of 


y. 
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for example, is the universe scalable or does it have some other kind of 
structure? The theory of scale analysis tells what a scålable structure is, 
and the various properties possessed by such a structure. 

The practical problem is to obtain information about the structure 
from only a sample of items. It has already been indicated how an ade- 
quate sample of items can be chosen to test the hypothesis of scalability. 
Furthermore, the number of items to be used in a pretest must be dis- 
tinguished from the number of items to be used in a final study. One of 
the properties of a scalable universe is that only one or two items can 
be used in a final study for many purposes once their place in the uni- 
Verse is ascertained. The scalability of the universe must first be ana- 
lyzed, however, by a dozen or so items in a pretest. 

The statement that “most of those engaged in this type of research 
[public opinion] will probably find the inclusion of a series of questions 
which could be subjected to scale analysis not feasible from practical 
Considerations” (1, p. 159) does not accord with what is the actual prac- 
tice both in public opinion and in market research, as well as in general 
attitude research. It is because workers in these fields are concerned 
With a universe of content that they pretest various questions on the 
Same topic; it is a foolhardy pollster who bases conclusions on but a 
Single question. The use of the split-ballot is evidence of this concern 
with sampling of content. In-addition, ordinary polls often include 
Several questions on the same topic on the same ballot. The extreme 
Position taken by advocates of “open-ended interviewing” is to ask a 
whole series of questions of every respondent. And of course, conven- 
tional attitude surveys almost invariably use a substantial set of ques- 


tions for a given topic. $ p ; 
It is a misapprehension to believe that asking several questions 


on the same topic necessarily creates a problem of rapport. In one sur- 
vey made of a national cross-section by a leading public opinion polling 
agency, an area of content was defined and then sampled by four ques- 
tions, Some of the interviewers complained because of the great simi- 
larity of wording of questions. The questions were very similarly worded 
because the content concerned the size of the Navy and was very hard 
to discuss in different ways. But even under these adverse circumstan- 
Ces, the analysis was successful in showing that the area was scalable and 
that the zero point could be located properly by the intensity function. 
Even more questions in the same area had been used in the pretest in 
Tthaca on a cross-section of the population there, and interestingly 
enough there was no complaint either from the respondents or Laie 
Mterviewers, although the interviewers were no different from those 
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used in the national cross-section and had no knowledge whatsoever of 
what was involved in scale analysis. An area of apparently very similar 
questions is an exception rather than the rule. The example about 
desire for post-war schooling that Festinger has borrowed (1, p- 157) 
certainly provides no problem of rapport, and the general run of oe 
studied by public opinion polls do not present any particular problem 0 
rapport. Another large market research agency has tried scale analysis 
in a routine study and has found no difficulty whatsoever with it. Be- 
cause of its simplicity and its objective solution to the problem of bias; 
this agency plans to use this approach regularly. be 
It seems premature, then, to conclude that scale analysis cannot i 

carried out in practice in public opinion work. To the contrary, scale 
analysis is becoming more essential in this field because it affords for the 
first time a scientific solution to the basic problem of bias in public op!?" 
ion polls. This problem arises from the fact that a universe of contr 
being studied and any single question is but a sample of all poss! 
questions that could have been asked. How can one determine ee 
question does coincide with the zero point of the entire universe, Wea 
the point which divides those who are negative on the issue from t 
who are positive? j lem 

The intensity function provides a scientific solution to this prob 7 A 
(13). It provides both a definition and a technique for ascertaining © 
zero point for the population. Unless some such objective approach f 
the question of bias is used in public opinion polls, it cannot be certa 
how much credence to place on their reports. 7 rs 

By providing a solution to the problem of bias, scale analysis ei 
the way for asking questions in the manner which will best help e 
lish rapport with the respondent. The particular form of a ques an 
does not affect the results of scale analysis, so the research worker ne 
concentrate on obtaining the wording which will make the interview! e 
work go most smoothly, Thus scale analysis has a contribution to m 
toward increasing rapport in surveys rather than the contrary. BBH Ai 
hension that the opposite is true seems to be due to a misconception t* 
scale analysis Presupposes a particular way of asking questions. 

Tf Progress is to be made in the scientific study of attitudes, PUY z 
opinion, and achievement, it seems necessary to concentrate on t : 
problem of the structure of content. Techniques are not worth much F 
not guided by any thoery. The theory of scale analysis happens to os 
to simple and practical techniques. To compare these techniques W! i 
others, one would have to ask: what theory of structure guides am 
ternative techniques and how adequately is this theory served thereby 


plic 
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NOTE ON “A REVIEW OF LEADERSHIP STUDIES WITH 
PARTICULAR REFERENCE TO MILITARY PROBLEMS’ 


DONALD E. BAIER 
Personnel Research Section, A.G.O. 


The valuable report? with which this note is concerned “. .. sam- 


with problems of selecting leaders, particularly in the military field.” 4 
It is the purpose of this note to make available additional facts an 


comments which appear to bear on the following conclusions of the 
reviewer: 


1. “Progress has not been made in the development of criteria of leadership 
behavior... .” 


2. “Advances in methodology in this field are definitely not striking.” 


It is this writer’s belief that these conclusions, insofar as they are meant 
to apply to military leadership, are not entirely warranted. 

In two reports? published by the Medical Field Research Labora; 
tory, Camp Le Jeune, N. C., research on measurement of “leadership 
is reported. These studies indicate a substantial relationship (tetra 
choric 7 =.42) between superior officers’ reports of the combat perform- 
ance of Marine Corps officers graduated from the Corps Officer at 
didate School and the standing of these graduates among their pellon 
marines as indicated by a nomination procedure conducted during ai 
pre-officer training. The two sets of evaluations were completely 1n 
pendent. h 

An as yet unpublished follow-up study by the Personnel Resa 
Section, AGO, of West Point graduates after 18 months of duty 27 
Army officers also reveals a significant association (r =.51 for ee 
Officers) between inter-cadet ratings or leader-nominations and ge 
as an officer measured by the Officer Efficiency Report, WD, AGO For 


: : : re 
67. Here again there is basis for believing that the two measures & 
independent. 


eos 2 ily rep“ 
1 The opinions expressed herein are those of the author and do not necessarily 
resent the official view of the War Department. 


e to 
? JENKINS, WILLIAM O. A review of leadership studies with particular referenc 
military problems. Psychol. Bull., 1947, 44, 54-79. 


Field Research Laboratory Report No. 1, January 18, 1946 and No. 2, May 16, 
Camp Le Jeune, N. C. 
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The reviewer's account of the research upon which are based the cur- 
rent methods for selecting wartime officers for integration into the regu- 
lar Army may result in misunderstanding. In discussing the correlation 
between the Officer Evaluation Report and the criterion of leadership, 
the latter being a product of nominations by subordinates and peers 
with a veto power resting with the commanding officer of the group, 
Jenkins states: A 

. - . The degree to which the Commanding Officers’ ratings were weighted 
in the Officer Evaluation Report was not stated, but it appears likely that this 
factor played an important role. Substantial agreement between ratings by 
the C.O. and by fellow officers was to be expected. Since the OER had the 
highest validity, and the other measures when combined with it increased its 
correlation with the criterion only .07, these questions suggest the necessity for 
a further examination of the nature of the criterion here employed (p. 74). 


The Officer Evaluation Report was accomplished in the majority of 
cases by the immediate supervisor, not the C.O., and represented only 
the former’s evaluation of the ratee. The conclusion that substantial 
agreement between ratings by the C.O. and by fellow officers was to be 
expected does not appear to be justified. The C.O. was only one of from 
7 to 30 nominators who participated in determining the ratee’s criterion 
Standing. He had no knowledge of how the other members of the 
nomination group evaluated each ratee, and his rating was used only to 
eliminate from the criterion groups of High, Low or Middle those rare 
cases where the C.O. placed the rated officer in the opposite extreme 
from the combined ratings of his subordinates and peers. Later studies 
employing a criterion which did not include the C.O. showed no drop in 
the validity of the OER or FCL type rating device. It is our belief that 
the nomination criterion employed in the studies cited does represent 
Progress in the development of leadership criteria. 

With respect to methodology, the forced choice technique as exem- 
plified in the triads and tetrads of the OER and the recently revised 
Army Officer Efficiency Report seems to deserve more attention than 
the reviewer accords it. This technique, which has been described briefly 
in a paper titled “The Forced Choice Technique and Rating Scales, 
presented at the American Psychological Association meeting in Phila- 
delphia on 5 Sept. 1946 by the Personnel Research Section, AGO, not 
Only provides valid indicators of the rates standing on a aoma i 
criterion, but favorably influences ratings of overall competence (if ad 
are made immediately following completion of the FCL items) $ t i 
they show substantially less negative skewness. Clearly, the T 
choice technique is effective in diminishing rater-bias and in De 
the distribution and validity of ratings which are generally regar 


indicative of leadership performance. 
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Monn, Norman L. Psychology: The fundamentals of human adjustment. 
Boston: Houghton Mifflin, 1946. Pp. xviii+497. 


The importance of the introductory course in psychology cannot be 
overestimated for it determines to a great extent the student's attitude 
toward the subject and whether or not he goes any further with it. But 
the importance of the textbook depends to a large degree upon the in- 
structor. Some instructors lean heavily upon the text, others hardly 
at all. In reviewing a book, however, evaluation must be made as if it 
were the sole source of the student’s introduction to the subject, re- 
gardless of the instructor's predilections, interpretations, choice of ma- 
terial, or method of handling the course. Though there are suggeste 
readings at the end of each chapter in this as in other texts which the 
student is urged to consult, their influence is admittedly minor since the 
author of a text, as Munn says, writes with the feeling that he, in com- 
mon with most teachers of the subject, could “organize its topics 17 a 
more logical sequence, choose apter illustrations, find more interesting 
examples and... write a book that... would be more appealing t° 
instructors and students than any he has seen” (ii). Some other re- 
quirements of a good introductory text are succinctly suggested by 
President Leonard Carmichael in his Introduction wherein he discusses 
the reasons for studying Psychology today: as an essential part of ® 
general education; as preparation for professions like law, medicine, 
teaching, the ministry, and business; and for further professional wor! 
in the subject itself. That Munn has met both his own and the editor ê 
demands with considerable success there can be no question. The boos 
is plentifully provided with excellent diagrams, half-tones, and tabu- 
lar matter; it is full of concrete material chosen from a wide variety ° 
sources; and its approach is scientific throughout. 

It would seem, in the light of these virtues, that this text meets the 
requirements of the introductory course almost to perfection. It is SUC 
a splendid job in so many respects that any criticism at all seems SUP” 
erogatory, if not hypercritical; and yet there are qualities expected 19 an 
elementary text which are of equal or greater importance than the ones 
which this book has in such large measure. The most important lac uF 
an underlying point of view or theoretical structure integrating 2” 
unifying the topics and their relations to each other and the subject a5 ĉ 
whole. We shall find that in this respect the book is not up to its 2 
complishments otherwise. 

The book is divided into seven main parts and these in turn into two 
or more chapters. It begins with a discussion of general, methodologic? 
and historical material, then proceeds to consider in turn the anatomic? 
and physiological bases of behavior, learning, remembering, thinking, 
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motivation, conflict, feeling, perceiving, the special senses, statistics, in- 
telligence, aptitudes, and personality. The treatment develops by 
consideration of simpler processes followed by the more complex, in so 
far as possible, though there is some back-tracking which is done with a 
minimum of repetition of earlier material. With the general plan of the 
work before us we can now consider it more in detail. 

Beginning with the origin and scope of psychology, the first two 
chapters are devoted to a brief glance at the history of the subject 
through a consideration of such topics as the psyche, the organism, 
methods in philosophy, physiology, and physics, analysis of conscious- 
ness and some fields of psychology. Chapters 1 and 2 really constitute 
a single topic or set of topics and furnish an excellent survey of methods, 
fields, and problems. They are properly brief, to the point, and very 
readable. Only in one detail does the text here need emendation. In 
discussing scientific controls it is stated that ‘‘there is never more than 
one independent variable in a given experiment .... If two or more fac- 
tors were varied, he (the scientist) obviously would not know which had 
produced the phenomena observed” (p. 23). While it is not expected 
that the logic of analysis of variance and designed experiments should 
be presented at the elementary level (though it is not impossible): ad- 
vances in statistics have made the older Mill-Bacon canons of scientific 
procedure represented by this statement quite out-of-date. Variation of 
only a single variable in psychological experiments is possible so seldom 
as to be almost a fiction and now that we have the statistical tools for 
handling multiple variates we might as well give up the fiction. 

Part 2 deals with psychological development and consists of three 
chapters: origin and psychological significance of response systems, con- 
ception to maturity, and factors in psychological growth. Here the 
biological bases of behavior are explained and the psychological proc- 
esses most directly correlated with them are brought in, The result is 
that the simpler and more complex processes are more or less inter- 
mingled in these chapters as a partial list of the topics reveals: structure 
and functions of receptor and nervous systems, embryonic development, 
Sensitivity, locomotion, prehension, language, gestures, writing, genes, 
heredity, environment, and maturation. The order is in general from 
simple to complex but there are some reversals. Thus one would expect 
a discussion of genes and embryological development before discussion 
of the nervous system but here it follows the latter. The reason for 
Munn’s order is obvious to 2 reader of the book and a good one: dis- 
cussion of the more elementary gene units links directly with problems 
of heredity, environment, maturation and growth. There is no discus- 
sion of nerve action potentials and the treatment of the autonomic 
Nervous system is postponed to the chapter on emotion where diagrams 
illustrating its relations to the cerebro-spinal system are given. The 
reviewer finds it impossible to omit the autonomic nervous system when 
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explaining the rest of the nervous system, although like Munn, he finds 
greatest use for it in the discussion of feelings and emotions. Figure 13 
in chapter 3, showing the spinal reflex-arc system, would not be over- 
complicated if the sympathetic ganglion and its fibers were included as 
is usually done, and with some textual discussion this would remedy a 
serious omission at this point. On the other hand, Munn has included 
more material on the nervous system than is generally presented. The 
diagrams showing different types of synaptic connections make inter- 
action, facilitation, and inhibition intelligible neurologically. The dis- 
eee of cortical representation of sensory functions is especially well 
one. 

The chapters on conditioning, learning, memory and thinking suc- 
ceed in presenting a considerable amount of material but suffer from the 
lack of clear integrating principles. While Munn rejects classical, 
Pavlovian conditioning theory as an adequate account of all learned 
behavior it is not clear what principles he would employ instead. That 
the author places greatest reliance on trial and error, past experience, 
and association appears from his treatment of certain particular prob- 
lems rather than from explicit structuration of the material. One must 
dig his underlying approach out from a few critical cases which reve? 
the author's fundamental position. Thus the explanation of how the 
chimpanzee reaches an apparently inaccessible object is a case in point. 
According to Munn we are to suppose “...a chimpanzee has, in the 
jungle, learned to reach otherwise inaccessible objects by swinging 
toward them on a vine. Now in the psychological laboratory, he 1s 
confronted with an apparently inaccessible banana. A rope, howeve", 
is hanging nearby. If the animal sees the similarity between the rope a” 
the vine, or between his jungle method and the one now possible, he may 
solve the problem immediately” (p. 122). (Italics are the reviewer's.) OW 
this explanation of the ape’s accomplishment in terms of past experience 
which at first sight seems to be the scientifically simplest explanation 
actually turns out on closer scrutiny to demand much more in the way 
of memory and intellectual ability than the proposition that the anima 
simply sees the relevance of the rope to the banana which is immediately 
given. This assumption should not be too difficult to make since it W4° 
pointed out earlier that the difference between classical (mechanica 
and instrumental conditioning lies in the fact that conditioning °C” 
curs much more easily when in the direction of more relevant Te 
sponses. Certainly if the principle of relevance is basic to conditioning 
it may be accepted for the much more complicated case of insight! 
behavior. The welter of factors involved in acquiring skill and learning 
could be better ordered and made more meaningfully connected if some 
structure were seen behind the facts in question. 

The lack of an adequate theoretical framework plagues the reader 
most in the concluding chapter on thinking. Reasoning, we are told, ! 


"| 
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implicit trial and error, it is a form of controlled association, it is a com- 
bining of past experiences in order to solve problems which cannot be 
solved by mere reproduction of earlier solutions. At the same time the 
role of direction in reasoning and recall is emphasized, but how this 
factor operates with trial and error, past associations, and mere repro- 
duction of earlier solutions to problems is not faced. We are here smack 
up against the problem of organization which many workers in biology 
as well as psychology realize cannot be dealt with adequately except as 
a problem in its own right. From the reviewer's experience such prob- 
lems cannot be evaded even in the beginning course because many stu- 
dents have already faced them in courses in philosophy, logic, biology, 
and elsewhere. 

The section on motivation of behavior seems to this reviewer to be 
best in the opening chapter dealing with physiological drives such as 
hunger, thirst, and sex where the material is largely drawn from experi- 
mental sources. The chapter on common social motives reads too much 
like a re-wording of the instinct psychology with too little use made of 
laboratory findings relevant to the topic. The chapter on conflict opens 
with sources of conflict in the environment and in the individual and 
then presents topological representation of conflict situations as an 
‘interesting and illuminating method of representing and analyzing 
conflict situations” (p. 245). However, in the succeeding treatment of 
reactions to conflict such as compensation, identification, phantasy, 
Projection, repression, and experimentally produced conflicts, there is 
no further use made of Lewinian concepts. Again the integration must 


either be made by the instructor or the student will suffer from intellec- 


tual indigestion. Other alternatives are to omit topological representa- 
tion or to put it at the very end of the chapter, pointing out that some, 
much, or most of the material discussed (depending upon the degree to 
Which the instructor knows topological psychology) can be diagrammed 
1n these terms. The author's penchant for trial and error pops out again 
in his recommendation of it as a possible solution in the alleviation or 
cure of conflict, contrary to the usual emphasis on rational procedures in 
Psychotherapy. Since the patient admittedly knows why he is trying 
Various lines of action, namely to find a way out of his conflict, it 1s 
doubtful if the procedure recommended is truly trial and error as Munn 


Says. z 
"The section on feeling and emotion which follows the one on motiva- 
tion of behavior might well have preceded it, as affective states have 
been regarded by almost everyone as motivators of behavior. The main 
ndings in the field are well covered with one or two exceptions. In the 


discussion of the Cannon-Bard theory the inhibitory function of the 
Cortex ig not mentioned an ing the contrasting 


feature i ory the cortico- 
s of this theory as 28 e k 
thalamic inhibitory path is not even shown. In view of the great 1m 
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portance of the role played by the cortex in inhibiting emotion through 
positive inhibitory regulation and in allowing emotional expression 
through release of inhibition, the account offered here is entirely inade- 
quate if not misleading, as reference to Bard’s exposition in the Hand- 
book of General Experimental Psychology, pp. 305-307, will show. Both 
text and diagram need this aspect of the theory for a correct as well as 
complete statement. 

The following section, Knowing Our World, deals with attention and 
the special senses. Munn has here done an excellent job of boiling down 
the classical, and for the most part, stereotyped material and he has 
made it attractive by the use of well-chosen diagrams and half-tones- 
In view of the tremendous use made during the war years and now ° 
material from the fields of sensation, perception, and psychophysics, not 
to mention their interrelations with sensori-motor learning, the time 1S 
past when we can rest content with traditional accounts of these fields. 
There is a wealth of material not yet in any text which modifies the 
whole approach to sensory processes and bears on every other field of 
psychology which should form part of the elementary student's equip- 
ment, Why recent work in some fields finds its way almost at once into 
textbooks and equally important work in other fields must wait a gen" 
eration or more is hard to understand. For example, the explanatio# 
given of constancy is naive in the extreme in the light of not-so-recen 
work, And the epoch-making contributions by Katz find no reflection 
in the treatment of vision even though they had been in print for 
years at the time this book appeared. : 

Several inaccuracies in terminology and fact should be corrected 17 
future editions, such as: brightness and lightness should not be used in” 
terchangeably, and unless film and aperture modes are distinguishe¢ 
is impossible to appreciate their difference; the assumption that Hering 
“neutral gray” is a constant or a general phenomenon rather than 
special case is not tenable in view of work by Koffka and others; 
discussion of retinal mixture versus overlapping of lights is s0 uncle 
it is impossible to determine what is meant and if it is correct; the pide 
explanation of Flor contrast as due to softening or obliteration © cone 
tours is palpably wrong and needs to be supplanted by the correct €% 
planation given by von Bezold in the early part of the present century) 
the interchangeable use of note and tone, so common in discussions © 
hearing, should be replaced by more precise terminology in which tone 
relates to hearing-experience and note to the printed symbol. Only ont 
figure of sensory qualities, the double cone for vision, is given althous 
the smell prism and the taste tetrahedron are just as good in their re- 
spective modalities. The value of 1/3 as the Weber fraction for temper 
ture is altogether too large to be representative following Culler’s vory 

In general the chapters just considered rely too much, 0^ 
theoretical side, on past experience and similar explanations and suffer 


£ 
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from a lack of unifying principles by which intra- and inter-sensory 
material may be related as well as unified with other psychological 
processes. If, as admitted, principles of organization are effective in 
attending, are they not perhaps also of importance in perception, and 
taking a further step, in learning and thinking as well? Recognition of 
such principles might unify and simplify psychology for the beginner. 
The seventh and final section of the book, Individual Differences, 
contains chapters on statistics, intelligence, aptitudes, and personality. 
The chapter on statistics, kept until this section as an “Introduction to 
Statistical Analysis of Individual Differences,” might well come earlier, 
especially for use in courses with laboratory. However the chapter can 
be introduced as it stands in almost any part of the course so its actual 
Position matters little. The other three chapters form a fitting close to 
the book, entirely in the spirit of the more experimental portions in 
being packed with concrete material. Intelligence is approached from 
the historical angle and the important question of heredity and en- 
vironment is quite fully discussed. The discussion of factor analysis— 
including the fundamental factors found by Thurstone, and the illus- 
tratory material from test batteries—make this chapter unique for an 
elementary presentation and one of the finest things in the book. 
Similarly the chapters on aptitudes and personality are extremely 
Well presented and again demonstrate the author's ability to condense 
a large amount of fact into a relatively small compass. In the chapter on 
Personality the discussion includes methods of approach such as case 
history, rating, paper and pencil tests, behavior tests, interviews, free 
association, dream analysis, and projective methods, and also physique 
and temperament, role of the endocrines, and abnormal states. The 
Open, empirical treatment here is more acceptable because the subject 
is more familiar to the average student and personality as a concept 
already provides some structuration by which its data can be ordered. 
Taking the book as a whole, what are its pros and cons? On the plus 
side it is an excellent text in so far as it provides a wealth of concrete 
actual data chosen from widely different sources both within and out- 
Side psychology proper. With some exceptions it represents present-day 
Scientific psychology very fairly. The student should come aways from 
is text with respect for the scientific approach not because he has been 
told that it works best in various fields but because he has found that 
Material obtained by scientific methods can be applied to many different 


Ne situati leads to further fruitful discovery. If;the author is 
mee io his criticism is so mild and fair that the 


ell as for psychology in general is in 


Qe Wise diminished. 
eae a very good effect on 
ten personal or institution 
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detrimental both to themselves and to psychology. This book should 
serve as an excellent corrective to this tendency. 

On the negative side of an otherwise fine piece of writing and presen- 
tation must be noted the lack of an integrating and unifying point of 
view which has been pointed out in our previous discussion. This lack 
results in a looser and more disjointed treatment than is necessary 1n the 
light of present advances along various fronts. This is not meant to 
imply that Munn himself does not have a point of view. As we have seen, 
careful reading reveals that for him trial and error is the great principle 
operating in human behavior and a number of indications are present 
that he believes in what has been dubbed an “atomistic logic,” 4.654 
proceeding from “simples” to “complexes.” But having brought into 
his discussion of conditioning the principle of relevance, into thinking 
the principle of direction, into some sensory experiences primitive or- 
ganization, and having recognized other whole-properties as well, he 
is under obligation to apply them more generally where they are appli- 
cable or at least to square them with the fundamental principles he 
believes are operative. Perhaps he has done this and this reviewer has 
missed it. If so, then it is probable that most students will fail to see 
how it all goes together. 

The tendency to over-simplify has already been pointed out with 
reference to certain neural diagrams but it occurs much more frequently 
in the textual discussion where it leads the author, in making points 
which are quite valid in themselves, to say things he cannot possibly 
mean as they stand. For example, in writing about the influence of 
animal experiments on the theoretical basis of psychology, there ap- 
pears the remarkable statement: “After all, learning is learning and vi- 
sion is vision whether it occurs in man or animal” (p. 10). But the vision 
of the most widely used laboratory animal, the white rat, is very differ- 
ent from that of man, from retina to higher cortical centers, and Munn 
later points out that “Insight is rare in animals, not quite so rare in 
children, and quite common in human adults” (p. 109), meaning to 
distinguish among kinds of learning. One finds too many statements 
like this which take a good deal of explaining to mitigate. 

There can be little doubt the present book will set a pattern for 
future introductory texts. The double columns while providing a shorter 
reading line and more words per page also make possible wider spaces 
for illustrative material and marginal notes. The wealth of charts, 
diagrams, and pictures lessens the instructor’s blackboard work and 
should prove a boon to places where laboratory work cannot be given. 
In this text psychology appears as a positive, if not positivistic, science. 
If it were possible to combine what Munn has done with more emphasis 
on methods and unifying principles we should be much nearer the per 
fect presentation of present-day psychology everyone desires. 

Harry HELSON: 

Bryn Mawr College. 
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BRIDGES, J. W. Psychology normal and abnormal. Toronto: Sir Isaac 
Pitman & Sons, 1946. Pp. xviii +470. 


Except for dropping the chapter on philosophical foundations, the 
splitting and partial revision of the chapter on reflexes and instincts, and 
the annotation of the extensive bibliography, the 1946 edition has 

identical twin” resemblance to its 1930 predecessor. The appearance 
of the revision does, however, call attention again to a book with a 
classical timelessness of integration (despite an eclectic tolerance), 
stimulating hypotheses, and a style reminiscent of William James. The 
general reader and even the professional psychologist will find interest 
and value here, although the book was designed for the introductory 
psychology course of pre-medical and medical students. 

Bridges chooses to give the distilled essence of a topic rather than to 
lead the reader to a conclusion from the raw data of experiments and 
case studies. The few graphs, tables, and other reference to specific 
studies are illustrative only. Just as the dramatist’s words furnished the 
bare Elizabethan stage, the emphasis on the logic of the argument in 
this book seems to stimulate more associations and imagery than one 
gets from many texts replete with illustrations. Many more of the quo- 
tations are from English and French psychologists than one meets in 
most American texts. 

The chapter headings might have come from any of a number of 
general psychologies, but the plan of devoting the first half of each 
chapter to normal behavior and the remainder to the related abnormali- 
ties makes a distinctive pattern throughout the book. Technical words 
are italicized and well defined. The chapter on applied psychology 
delimits that field with such precision and perspective that it ought to 
be widely reprinted. 

An error not corrected from the 1930 edition is the taking of the 
standard deviation from the median. Emphasis on the older studies, 
e.g., Downey will-temperament tests, is heavier than on those of the 
last two decades. 

The problem of how to teach psychology in medical schools or to 
pre-medical students seems to have led in at least three main directions. 
Some have emphasized a sociological-psychological approach, as in 
Pressey’s Life, since this is a common medical blind spot. Others have 
stressed the genetic-psychosomatic attitude, as found in such authors as 
Maslow and Mittlemann. A third group believe that medical students 
are more highly motivated and gain more insight from the contrasts and 
comparisons of the normal and abnormal. Bridges from his experience 
as the first professor of abnormal psychology on a medical faculty has 
provided an effective text for the last group. 

Grorce M. HASLERUD. 

University of New Hampshire. 
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Gray, J. S. Psychology in human affairs. New York: McGraw-Hill, 
1946. Pp. viii+646. 


While this book is, in many respects, a successor to Gray’s pre- 
viously edited text, Psychology in Use (American Book Co., 1941), in 
that it discusses the applications of psychology to the main fields of 
practical life, and represents the co-authorship of eleven other con- 
tributors, nevertheless it is not merely a revision. With two excep- 
tions, the co-authors are new. They are less well known than those of 
the former book, but Gray has himself taken a more active part in the 
actual writing of the text. Several chapters appear for the first time, 
such as ‘Psychology in Speech Correction,” “Psychology in Music, 
Art, and Literature,” and “Psychology in Military Affairs.” Others 
appear under new titles, and are written from a new viewpoint. 

Perhaps the outstanding characteristic of the book is its emphasis on 
factual material. For example, Chapter II, on ‘‘Psychology in College 
Life” contains twenty tables and four graphs. Chapter III, on “Child 
Development” contains ten graphs and twenty-one tables. Much of 
this material is new to textbooks, and with few exceptions, the refer- 
ences are to studies published after 1930. The general effect of this 
emphasis on experimental data and practical findings is to require a 
change in teaching methods on the instructor's part. His function is no 
longer to supplement the text with up-to-date illustrative material, but 
rather to interpret and evaluate that which is given. Less supplemen- 
tary assigned reading is needed, and much more digesting of the text by 
the student. The art of reading and interpreting tables and graphs is 
one which requires special training. Many students are allergic to sta- 
tistics, though this is not in itself an argument for using them sparingly, 
if the instructor is competent to vitalize them. But the chief value of 
facts is to illustrate and support laws, principles and theoretical formu- 
lations. They are most effectively used in the inductive development of 
atopic. Most of these facts are and should be promptly forgotten by the 
student, so that his memory is freed for the permanent retention of the 
principles. The immature student needs much expert guidance in rec- 
ognizing the bare essentials of fact to be learned. While this book is 
many strides ahead of the type which presents only unsu pported asser- 
tions, or illustrations a for their patness only, does it err slightly 
in the opposite direction a. n 
in oe important characteristic of the book is its emphasis on the 
practical. This is to be expected in an applied psychology ered 
seldom achieved. Omnibus books too often give an impression ot sketchy 
remoteness, with little practical contact, while technical treatises 
are written for the advanced student who wants specialized Soim 

This book, by achieving a compromise between these two extr 


by emphasizing the practical aspects of each field for the layman, fills a 
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real need. Its range of topics is wider, its treatment more complete than 
is customary in such books. 

In spite of its up-to-dateness, the book occasionally presents old 
data which have been superseded, or theory which is now modified. In 
one or two cases, quite erroneous statements appear, such as the follow- 
ing, in connection with a discussion of the topic of I.Q. constancy, on 
Pages 91-92: 

If the child develops mentally at exactly the same rate as other children 
tested, his I.Q. will remain constant. However, if his mental development is 
faster than that of other children, his I.Q. will increase. Likewise, if his mental 
development is slower than that of other children, his I.Q. will decrease. 


The author seems to have confused constancy of 1.Q. with normality of 
LQ., for if the statement were taken as it stands, it would mean that no 
child's I.Q. is constant if he has a faster or slower developmental rate 
than the average child. 

An innovation in this book which probably has pedagogical value 
and would be used oftener by authors of texts if publishers would let- 
them, is the table of contents at the beginning of each chapter. The 
addition of page numbers would increase its value. A word must be 
Said in criticism of certain reproduced charts, in which the reduction in 
Size of print needed to get them on the page has made them unreadable; 
for example, those on pp. 474 and 573. Otherwise the style of the book 
is good. 

Many psychology teachers will welcome this book either as a supple- 
ment to the general course in psychology, or as a second course to follow 
the introductory one. Those who teach adult extension classes will find 
lt an excellent survey text, both meaty and comprehensive. 

ARTHUR G. BILLS. 

University of Cincinnati. 


Lucr, J. M., & Hatt, V. E. (Eds.). Annual review of physiology (Vols. 
VII & VIII). Stanford Univ. P.O.: Annual Reviews, Inc. and 
American Physiological Society, 1945, 1946. Pp. vi+774 (Vol. 
VII). Pp. vi+658 (Vol. VIII). 


These are Volumes VII and VIII of the annual series begun in 1937 
and published jointly by the American Physiological Society and An- 
nual Reviews. Itis the declared editorial policy of the Review that “en- 
Couragement is given only to preparation of reviews which survey the 
Important contributions of the preceding year or biennium, which ap- 
Praise them critically, and evaluate with discrimination the present 
Status of the subject. Comprehensive reviews in which the task of the 
author is one of compilation rather than of appraisal are deliberately 
schewed.”” Despite this policy, some of the reviews are principally 
Compilations or annotated bibliographies. And some are rather spotty 
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compilations mixed with evaluation, while only a few reach the goal of 
really critical reviews of the literature. On the whole, however, the 
reader can obtain a picture of the more significant aspects of research 
progress in the respective fields covered by the review. 

Volume VII contains 26 chapters written by 30 authors, and Volume 
VIII contains 25 chapters by a total of 29 authors. In each case, 
bibliographies of literature cited in the various reviews total about 4,000 
references. At the end of each volume is an author index of about 4,000 
items and a subject index of about 40 pages in length. 

Because the reviews are written by physiologists for physiologists, 
more than half of each volume is of no interest to the psychologist except 
that occasionally there is a brief treatment in a sentence or two of some 
psycho-physiological problem. The psychologist, however, who wants 
to find out what has happened recently in some special aspect of 
physiology relevant to his field of teaching or research will find that 
his best bet is to go to these volumes and to consult the excellent author 
and subject indices before resorting to other less up-to-date textbooks 
or to more laborious methods of library research. 

More than that, however, many of the special chapters in physiology 
are good reading for psychologists engaged in the respectively related 
field of psychology: for genetic psychology, Physiological aspects of 
genetics VII and VIII) and Developmental physiology (VII and VIII); 
for sensory psychology, The special senses (VII) and Audition (VIII); 
for neural mechanisms of behavior, Electrical activity of the brain (VII), 
Conduction and synaptic transmission in the nervous system (VIL), Nerve 
and synaptic conduction (VIII), The visceral functions of the nervous 
system (VII and VIII); and for a general review of physiological psy- 
chology, Physiological psychology (VII and VIII). 

The contrast between the chapters on physiological psychology in 
the two volumes deserves a special comment. In Volume VII, Stone 
has presented a careful and critical review. He has covered thoroughly 
the recent literature and has appraised its strength and shortcomings so 
that the reader can see what has happened and what it means for phys- 


iological psychology. 

The same chapter in Volume VIII by Seashore, however, does 

neither of these two things. It opens with a philosophical discussion of 
of the scientific approach to it. The chapter 


the mind-body problem and ach to 
then proceeds to summarize the present status of individual differences 
in skills, abilities, aptitudes and capacities. Finally, it gives a general 


summary of the effects of extreme working conditions upon the effec- 
tiveness of human performance. Thus Seashore’s chapter spends a lot 
of time on problems which are not physiological psychology, in any 
reasonable definition of the field; and he fails to review or appraise the 
recent literature in the field. As a consequence, the physiologist reading 
the two chapters is likely to be bewildered by two so very different co” 


cepts of physiological psychology. 
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Looked at in perspective, these two volumes of Annual Review of 
Physiology, like previous volumes in the series, are an extremely valu- 
able aid, not only to physiologists, but to all those for whom physiology 
is an important ancillary subject. By and large, the chapters give schol- 
arly up-to-date appraisals of their respective fields. As he has stated 
before, this reviewer feels that a companion volume, giving an annual 
review of psychology, would be an invaluable aid to psychologists, which 
would help us “keep up with the literature” and give us better perspec- 
tive on the developments in our field. 

CLIFFORD T. MORGAN. 

The Johns Hopkins University. 


BARKER, ROGER G., WRIGHT, BEATRICE A., AND GONICK, MOLLIE R. 
Adjustment to physical handicap and illness: a survey of the social 
psychology of physique and disability. New York: Social Science 
Research Council, 1946. (Bulletin No. 55.) Pp. xi+372. 


This is another in the excellent series of research summaries spon- 
sored by the Social Science Research Council. The authors have earned 
Special commendation by providing intelligently critical comments as 
to the assumptions and thinking of earlier investigators, rather than 
merely reporting data and conclusions; by writing this summary of 
Prior research into a theoretical frame of reference (topological psy- 
chology); and by introducing some well-chosen original material to 
illuminate the inferences drawn from published sources. 

From their survey the authors have eliminated somato-psychologi- 
cal studies of age, sex, race, and speech defects, on the ground that these 

ave recently been covered adequately by other reviewers. Leprosy is 
discarded as a minor problem in the western world. Of the remaining 
areas, detailed reports are presented on: normal variations in physique; 
‘rippling; tubercular conditions; impaired hearing; and acute illness. 
‘ibliographies are added on: visual disability; cardiac conditions; 
labetes mellitus; cosmetic defect; rheumatism; and cancer. 

The least satisfactory chapter is that on variations in normal phy- 
Sique. There is a good section on size changes at adolescence, but the 
discussion of variations in adult size is rather elementary, and no men- 
tion whatever is made of Sheldon’s The varieties of human physique and 
` he varieties of temperament. Even if one does not accept Sheldon’s 

Cory, his work can hardly be ignored. The authors occasionally dip a 
esitant toe into the cold waters of endocrinology, genetics and autonomic 
Nervous function, then withdraw hastily. Clarity would have been im- 
Proved by frankly excluding such material. 
im Outstanding treatments of crippling, tubercular conditions and 
: Paired hearing more than atone for any shortcomings of the earlier 
oie Particularly interesting is the mode of analysis in terms of 
wine situations. A disabled person is able to function on a par 
normals in some environments; he is decisively barred from other 
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situations; but between these extremes will fall a range of ambiguous 
conditions in which the handicapped person may participate, but under 
difficulties. The Lewinian concepts of barrier, valence, potency and 
congruence are used fruitfully to show basic similarities between the 
situations facing the orthopedic cripple, the tubercular, the deaf and the 
individual with acute illness. 

1 The person with impaired hearing, for example, functions in many 
situations unnoticed by his normal associates. If the behavior involved 
does not require auditory controls, he may compete on equal terms. 
Where hearing is involved, he may be handicapped and subject to extra 
criticism, since his impairment is not obvious and many normals (e.g., 
school teachers) fail to make allowances for it. The barriers in his field 
are indefinite (as contrasted with the orthopedic cripple, for example, to 
whom certain activities are plainly impossible), and this condition often 
gives rise to vacillation and instability. The valence of full-normal ac- 
tivity is positive and high, but the valence of failure and criticism is 
negative and high. Thus physically handicapped persons are likely to 
show the familiar symptoms of conflict. 

We occasionally feel, in these topological analyses, that there is an 
unstated shift from the topology of the external situation (geographical 
environment) to the situation as perceived by the individual (behavioral 
environment). In the case of cripples, for example, some activities are 
objectively impossible, whereas others are subjectively considered to be 
impossible. It is clear that these two should not be treated as identical, 
and yet that impression is sometimes given. If the entire analysis were 
erected on a perceptual basis, this uncertainty could have been avoided. 

The importance of the individual’s perception of his defect, and of 
his behavioral field, is well illustrated by the discussion of family 
attitudes toward the handicapped. Many parents reject the handi- 
capped child, while others over-protect him and keep him in an infantile 
status. Optimum adjustment seems to be achieved when the parents 
adopt an understanding, objective attitude which focuses the child’s 
attention on realistic assessment of the situation. Excessive sympathy 
and pity are likely to encourage exaggeration of barriers and exclusion 
of many possibilities for normal participation. ; à 

The final chapter on employment of disabled persons gives a realis- 
tic and well-considered treatment of this problem, the solution of which 


is basic to optimum adjustment of handicapped adults. 
Ross STAGNER. 


Dartmouth College. 


Lewis, CLAUDIA. Children of the Cumberland. New York: Columbia 
Univ. Press, 1946. Pp. xviii+217. 


Before going to the Southern highlands, Miss Lewis was a teacher in 
the Harriet Johnson Nursery School in Greenwich Village, New York 
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City. She compares the behavior of the children in the nursery school 
which she established in the mountains of Tennessee with the behavior 
of her Greenwich Village pupils. The majority of the material presented 
in the book concerns the mountaineer subjects. , 

A considerable part of the volume consists of a collection of incidents 
involving child care or child behavior. These range in length from single 
Sentences to a page or more, in age of subjects from birth to senility, in 
form from dialogue to descriptive essays. They are extremely readable 
and serve to render very vivid the life of the mountain people. 

Miss Lewis does not attempt to present quantitative measures, but 
for this she cannot be reprimanded. It is apparent that she devoted 
very full days to the nursery school, and that research had a secondary 


Place. Nevertheless, her thinking is quantitative. She emphasizes the 
er of individual behavior which takes place in both schools, and 
makes 


clear that there is overlapping between the schools. However, it 
is her belief that there are large differences in central tendencies be- 
tween her two kinds of subjects. With this contention, it is likely that 
nearly every person who is familiar with both types of children will 
agree, , 

Miss Lewis finds more spontaneity, more energy, more conflict, more 
aggression in the New York group. The mountain children are more 
placid, more compliant, more quiet, and in some respects, better ad- 
Justed. 

She does not find the explanation of these differences in any one 
factor, Among the probable causes mentioned are the following: the 
differences in spaciousness of the environment, differences in climate, 
ealth, and nutrition, differences in sleeping habits, differences in infant 
Care and family structure, differences in discipline, and differences in 
environmental stimulation. 
hroughout the book, Miss Lewis shows an excellent understanding 
Of child development in both New York and Tennessee—not an easy 
achievement, She also displays a high degree of ability as a writer. This 
Combination of traits makes her book one of the best in the “child in 
a culture” field. It should be profitable alike to teacher, parent, psy- 
Chologist, and sociologist. 
he reader may sample for himself some of Miss Lewis’s attitudes 
and style in the following quotation from her concluding chapter: 
t No, now that this study is made Iam not packing my trunks with the in- 
fat of moving down to Tennessee, building me a cabin, taking to the “simple 
doe and rearing my hypothetical children in the way the Summerville families 
Sh For usitis nota question of attempting to turn the clock back in that way, 
ich, indeed, would be as impossible as it would be undesirable. It is rather a 
“ee of trying to bring to Greenwich Village a little more of . . . Summer- 
Merren: a 
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situations; but between these extremes will fall a range of ambiguous 
conditions in which the handicapped person may participate, but under 
difficulties. The Lewinian concepts of barrier, valence, potency and 
congruence are used fruitfully to show basic similarities between the 
situations facing the orthopedic cripple, the tubercular, the deaf and the 
individual with acute illness. 

i The person with impaired hearing, for example, functions in many 
situations unnoticed by his normal associates. If the behavior involved 
does not require auditory controls, he may compete on equal terms. 
Where hearing is involved, he may be handicapped and subject to extra 
criticism, since his impairment is not obvious and many normals (e.g., 
school teachers) fail to make allowances for it. The barriers in his field 
are indefinite (as contrasted with the orthopedic cripple, for example, to 
whom certain activities are plainly impossible), and this condition often 
gives rise to vacillation and instability. The valence of full-normal ac- 
tivity is positive and high, but the valence of failure and criticism is 
negative and high. Thus physically handicapped persons are likely to 
show the familiar symptoms of conflict. 

We occasionally feel, in these topological analyses, that there is an 
unstated shift from the topology of the external situation (geographical 
environment) to the situation as perceived by the individual (behavioral 
environment). In the case of cripples, for example, some activities are 
objectively impossible, whereas others are subjectively considered to be 
impossible. It is clear that these two should not be treated as identical, 
and yet that impression is sometimes given. If the entire analysis were 
erected on a perceptual basis, this uncertainty could have been avoided. 

The importance of the individual’s perception of his defect, and of 
his behavioral field, is well illustrated by the discussion of family 
attitudes toward the handicapped. Many parents reject the handi- 
capped child, while others over-protect him and keep him in an infantile 
status. Optimum adjustment seems to be achieved when the parents 
adopt an understanding, objective attitude which focuses the child’s 
attention on realistic assessment of the situation. Excessive sympathy 
and pity are likely to encourage exaggeration of barriers and exclusion 
of many possibilities for normal participation. 

The final chapter on employment of disabled persons gives a realis- 
tic and well-considered treatment of this problem, the solution of which 


is basic to optimum adjustment of handicapped adults. 
Ross STAGNER. 


Dartmouth College. 


Lewis, CLAUDIA. Children of the Cumberland. New York: Columbia 
Univ. Press, 1946. Pp. xviii+217. 


Before going to the Southern highlands, Miss Lewis was a teacher in 
the Harriet Johnson Nursery School in Greenwich Village, New York 
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City. She compares the behavior of the children in the nursery school 
which she established in the mountains of Tennessee with the behavior 
of her Greenwich Village pupils. The majority of the material presented 
in the book concerns the mountaineer subjects. 

_ A considerable part of the volume consists of a collection of incidents 
Involving child care or child behavior. These range in length from single 
ee to a page or more, in age of subjects from birth to senility, in 

n from dialogue to descriptive essays. They are extremely readable 
and serve to render very vivid the life of the mountain people. 
os ae Lewis does not attempt to present quantitative measures, but 
sa E she cannot be reprimanded. It is apparent that she devoted 
sins yn to the nursery school, and that research had a secondary 
diver: it evertheless, her thinking is quantitative. She emphasizes the 
mikes z of individual behavior which takes place in both schools, and 
is he 5 i that there is overlapping between the schools. However, it 
tween | elief that there are large differences in central tendencies be- 
= ner two kinds of subjects. With this contention, it is likely that 
Some every person who is familiar with both types of children will 

Miss Lewis finds more spontaneity, more energy, more conflict, more 
aggression in the New York group. The mountain children are more 
Placid, more compliant, more quiet, and in some respects, better ad- 
Justed. 

She does not find the explanation of these differences in any one 
factor, Among the probable causes mentioned are the following: the 
differences in spaciousness of the environment, differences in climate, 
health, and nutrition, differences in sleeping habits, differences in infant 
Care and family structure, differences in discipline, and differences in 
Environmental stimulation. fj 

Throughout the book, Miss Lewis shows an excellent understanding 
of child development in both New York and Tennessee—not an easy 
achievement. She also displays a high degree of ability as a writer. | This 
Combination of traits makes her book one of the best in the ‘‘child in 
a culture” field. It should be profitable alike to teacher, parent, psy- 
Chologist iologist. . 7 

iE i es ails for himself some of Miss Lewis’s attitudes 
and style in the following quotation from her concluding chapter: 

No, now that this study is made I am not packing my trunks with the in- 
tent of moving down to Tennessee, building me a cabin, taking to the ‘simple 
lite and rearing my hypothetical children in the way the Summerville families 
do: For usit is not a question of attempting to turn the clock back in that way, 
Which, indeed, would be as impossible as it would be undesirable. It is rather a 
oo of trying to bring to Greenwich Village a little more of . . - Summer- 

ille li 
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LEEPER, ROBERT. Psychology of personality. Ann Arbor: J. W. Edwards, 
1946. Pp. 167. 


The format of this book, resembling that of a laboratory manual or 
workbook, may lead many to overlook this significant treatment of 
personality and mental hygiene. 

The author's organization apparently proceeds from an assumption 
which has increasingly impressed itself on the reviewer in recent years: 
namely, that, in order to have functional significance, any treatment of 
mental hygiene must be based on a consistent theory of personality 
processes. Moreover, such a treatment should not be left among the 
author’s unverbalized potentialities, but should be given systematic 
formulation. It is to this task that the major emphasis of Leeper’s book 
is devoted. 

Leeper’s treatment is mainly an elaboration of the following thesis: 

In general, the term “personality” covers three things: (1) the person’s 
motives, and especially his emotional motives, or ways in which he responds 
emotionally in different life situations; (2) the general techniques by which, 
characteristically, he tries to attain satisfaction for these motives; and (3) the 
background of meanings or pictures of reality which determine the motives and 
types of adjustive responses of the person (p. 5). 


The discussion of motivation distinguishes between physiological 
and emotional motives and between positive can negative emotional 
motives. While the latter distinction seems arbitrary, since the ‘‘nega- 
tive” motives can be regarded as the products of frustration of the 
“positive” motives, it serves a useful expository purpose when the au- 
thor deals with motivational differences existing between well adjusted 
and poorly adjusted personalities. 

The techniques by which the person tries to attain satisfaction for 
his motives are considered from two points of view: (1) the nature of the 
learning processes, and (2) the description of effectual and ineffectual 
adjustment techniques. The learning processes are treated with due 
attention to the dynamic complexities recognized in modern learning 
theory. In addition to describing the usual techniques employed by 
maladjusted personalities, the author discusses some of the major 
techniques by which superior personalities distinguish themselves. 

In his discussion of the “background of meanings,” the author deals 
competently with an aspect of behavior which seldom receives the em- 
phasis merited by its significance for personality dynamics. Leeper's 
thesis is that “ ... a person cannot govern his behavior just by what 
is objectively and actually true, but . . . must forever live and react in 
terms of properties which he infers as existing because of his experience 
in previous situations” (p. 92). This view that behavior is determined 
not as much by the character of objective reality as by the individual’s 
interpretation of reality (through the phenomenon of emotional trans- 
ference) is supported in terms of the principle of equivalence of stimuli 
and the principle of substitute response or displacement. Treated in 
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these terms, the “background of meanings” is seen to be an aspect of 
personality whose importance has been emphasized by such widely 
separated disciplines as the research of animal psychologists and the 
clinical observations of psychoanalysts. 

It is the reviewer’s opinion that through the medium of Leeper’s 
book the principles of personality functioning are made understandable 
to the average undergraduate without undue simplification or loss in 
organic quality. For this reason, it seems regrettable that the book was 
not produced in a form more likely to have wide distribution. 

The book will probably be disappointing to those who feel that a 
textbook should serve as a compendium of psychological research find- 
ings. While the author draws freely upon research sources, these tend 
to lose their distinctive identities in the author's discussion. No bibliog- 
raphy or index is provided. 

Bert R. SAPPENFIELD. 

Montana State University. 


KerLiry, Douctas M. 22 cells in Nuremberg. New York: Greenberg, 

1947. Pp. 245. $3.00. 

The author of this book was for five months the official psychiatrist 
at the Nuremberg prison and in that capacity made psychiatric exami- 
nations af all the 22 top-ranking Nazi prisoners. The customary medical 
and psychiatric procedures were supplemented by Rorschach personal- 
ity tests and Wechsler-Bellevue intelligence tests given by the author’s 
fellow officer, Capt. G. M. Gilbert. The examinations and tests were 
further supplemented by information obtained from former intimate 
associates of the accused and from motion pictures, speeches, writings, 
and other records. 

Except for Rudolf Hess and Hermann Goering, this was the first 
Psychiatric study to be made of any of the accused. In view of the fact 
that twelve of the group are no longer living and that all but three of the 
others were disposed of by prison sentences of from ten years to life, the 
documents and conclusions of Dr. Kelley are destined to be of lasting 
historical interest. 

The task which the author set himself was not merely or chiefly to 
determine the degree of mental responsibility of the subjects, but rather 
to investigate their basic personality patterns. He wanted to find out 
what these men were like who had made themselves masters of eighty 
million people, and what factors in childhood, youth, and later years 
had made them what they were. The book attempts to answer these 
Questions in language sufficiently nontechnical to be intelligible to 
readers who are relatively unfamiliar with esoteric theories of personal- 
ity. We are informed that more detailed reports of the work will be 
Published later in professional journals, and that transcripts of the in- 
terviews and other records will ultimately be available to historians. 

Among those to whom most space is devoted are Goering (27 pages), 
Hess (22 pages), Ley (21 pages), Rosenberg (13 pages), and Streicher 


484 BOOK REVIEWS 


(11 pages). The other 17 members of the group get from three to eight 
pages each. By good luck the examination of Ley had been completed 
before he committed suicide, and we are told that the post-mortem 
examination of his brain confirmed the psychiatric diagnosis. 

With the exception of one chapter, the book is concerned entirely 
with the 22 Nazis who were studied first-hand by the author. The 
additional chapter presents a 35-page portrait of Hitler based on infor- 
mation and comments obtained from the Führer’s contemporaries, his 
aides, his personal physicians, and his secretaries. Some of this infor- 
mation is new, and the author’s interpretations differ in several impor- 
tant respects from those which have been current. 

Within the limits of a brief review it is not possible to summarize the 
author’s interpretations of the individual subjects. In fact, each por- 
trait as sketched is a unified gestalt that almost defies further condensa- 
tion. The sitters for these portraits composed indeed a motley group. 
They ranged from the stupid to the highly intelligent; from the semi 
insane to the stable and well integrated; from the shrewd and talented 
leader to the errand-boy hanger-on seeking in Hitler a father surrogate. 
But there were three characteristics which they had in common: in- 
ordinate ambition, debased ethical standards, and a hyperdeveloped 
nationalism that justified anything done in the name of Germandom— 
plus, of course, an economic and political environment that allowed full 
play to their ruthless wills. 

The author’s conclusion is that Nazism was a “socio-cultural dis- 
ease,” epidemic among our enemies but endemic everywhere. He tells 
us that the Nazi leaders were not the rare and spectacular types that 
can be expected to appear only once in a century. Instead, neurotics 
like Hitler, with ‘‘hysterical disorders and obsessive complaints, can be 
found in any psychiatric clinic.” Similar ones, thwarted and discour- 
aged, but determined to do great deeds, roam the streets of every Ameri- 
can city. ‘‘Strong, dominant, aggressive, and egocentric personalities 
like Goering ...can be found anywhere—behind big desks deciding 
big affairs as businessmen, politicians, and racketeers.’’ We hardly need 
to be reminded that men strongly resembling some of these types oc- 
casionally win election to our highest law-making bodies or to the gover- 
norship of a great state. h N 

Dr. Kelley has analyzed for us 22 types of totalitarian-virus, has 
described the soil in which they thrive, and has indicated some of the 
means by which society can protect itself against them. His book will 
inevitably be compared with one written by another psychiatrist— 
Brickner’s Is Germany Incurable? Of the two, the reviewer finds Kel- 


ley’s less controversial and no less challenging.* 
Lewis M. TERMAN. 


Stanford University. 


* Since this review was written, Nuremberg Diary, by Capt. G. M. Gilbert, has been 
published. This book should be read along with Dr. Kelley’s. L.M.T. 
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THE HISTORY AND PRESENT STATUS 
OF THE LAW OF EFFECT! 


LEO POSTMAN 
Indiana University 


Throughout history those concerned with the control of human be- 
havior—parents and educators, businessmen and lawmakers—have 
acted on the belief that rewards and punishments are powerful tools 
for the selection and fixation of desirable acts and the elimination of 
undesirable ones. This commonsense view of the nature of learning 
has not received undivided support from the professional students of 
behavior. Almost forty years after the first enunciation of the law of 
effect as a formal doctrine of learning, the problem of reinforcement is 
still the subject of heated controversy among the proponents of rival 
theories of learning. The question has been asked time and again 
whether this conflict is irreconcilable (47, 142). Perhaps there is some 
truth in all, or at least some, of the conflicting views, and the main source 
of difficulty may be the proneness of many theorists to attempt an 
explanation of all learning in terms of their favorite hypotheses. After 
years of debate, the possibility of a one-principle theory of learning— 
whether this principle be contiguity, substitution, reward, or expec- 
tancy—is still doubtful. This doubt is greatly strengthened by a survey 
of the history of the law of effect and an evaluation of its present status. 


HISTORICAL SOURCES? 


Even though the law of effect is invariably associated with the name 
of Thorndike, who first used this phrase (264), the principles embodied 
in the law have a long history antedating by many years the modern 
laboratory studies of learning. The formulation of the law reflected 


1 The writer wishes to express his great indebtedness to Professor Gordon W. Allport 
Who first suggested this review. His unceasing interest and generous advice have been 
valuable. 

2 The law of effect was previously reviewed in this journal by Waters (312). The 
Writer has greatly benefited from this discussion. For the sake of a unitary presentation 
Some of the problems raised by Dr. Waters are discussed again in this paper. 


489 


i 


A 


- much alive when t 


490 LEO POSTMAN 


the impact on psychology of three major trends in the history of 
thought: associationism, hedonism, and evolutionary theory. 
Associationism. Ever since Aristotle first wrote of the laws of associa- 
tion of thoughts (similarity, contrast, and contiguity), many philoso- 
phers and psychologists have treated the problems of learning, memory 
and coherent thinking in terms of associations formed among mental 
Hobbes, Locke, Hartley, Hume, Mill, Wundt, and many 
bed to the doctrine of association of ideas. Some pinned 
their faith on the principle of contiguity alone; others introduced simi- 
larity as a basic law; some writers, such as Hartley, suggested physio- 
logical foundations for the streams of associations. But in spite of diver- 
or that aspect of the doctrine, they all agreed that 
om the joining and weaving together 
of elements—for a long time the term idea predominated—into the 
content of consciousness. Associationism is, almost by definition, ele- 
mentaristic and connectionistic. It is elementaristic because it needs 
conceptual elements which get associated. It is connectionistic because 
it deals with the principles of connection among these elements. These 
two basic characteristics of associationistic thought are made explicit 
in the law of effect. 
Hedonism. Associationism is concerned with the laws of connection 
of mental elements: it has little to say about the role of motives in 
the acquisition of learned responses. Yet the study of learning cannot 
proceed long before coming face-to-face with the problem of motivation. 
In its approach to motivation, modern learning theory has deep roots 
in the philosophy of hedonism, and the development of the psychology 
of learning has been characterized by a stubborn defense of hedonistic 
principles on the one hand and a struggle for the emancipation from 
hedonism on the other. To the hedonist, pleasure and pain are the 
governing principles of behavior. The search for pleasure and the avoid- 
ance of pain are the mainsprings of conduct, individual and social, and 
the basis of social interaction and organization. The idea is very old 
indeed. Plato saw in pleasure and pain important motives of human 
action; Aristotle called them the basis of the will. Hobbes’ interpreta- 
tion of social life revolved around man’s sceking of pleasure and avoid- 
ance of pain. The pleasure-pain principle finally found its most system- 
atic expression in the doctrine of utilitarianism (Bentham) which 
regarded self-interest as a sufficient principle to account for most of 
individual and social action. The pleasure-pain principle was still very 
he law of effect was formulated. 
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and hedonism more closely together. The problem of adaptive behavior 
became central. Some responses to environmental stimuli are superior 
to others and lead to the survival of the fittest. The question at once 
arises how these superior adaptive responses are selected from the mul- 
tiplicity of responses of which an organism is capable, and then fixated 
and perpetuated. To those who tried to answer this question, hedonism 
and the pleasure-pain principle provided the principle of selection, 
and the laws of association the mechanism of fixation. 


EARLY FORMULATIONS OF A Law oF EFFECT 


The convergence of associationsim, hedonism, and evolutionary doc- 
trine is exemplified in the formulation of what is in substance a law of 
effect by Spencer (250) and by Bain (8). Their theories have been 
reviewed and analyzed in detail by Cason (33) but since they embody 
Salient features of the effect doctrine they will be briefly summarized 
here, Spencer’s (250) basic assumption is that in the course of natural 
selection there has been established in the various species a correlation 
between the pleasant and the beneficial, and a similar correlation be- 
tween the unpleasant and the injurious.? That which is pleasant is main- 
tained and repeated and proves beneficial to the biological organism. 
That which is painful is abandoned and the organism is protected from 
injury. Spencer then described a physiological mechanism for the se- 
lection of useful pleasant acts and the elimination of harmful unpleasant 
acts. Organisms respond to environmental stimuli in a highly variable, 
essentially random fashion with diffuse discharges of neural energy. 
Among these random responses there will be sooner or later one (ac- 
Cidental) response which is successful (for example, produces food for the 
animal). After success will immediately come pleasurable sensations 
With a large discharge of nervous energy toward the organs engaged 
1n the successful act, e.g., in eating. This heightened discharge of ner- 
Vous energy will render the successful channels of muscular action more 
Permeable. On recurrence of the circumstances the neural discharge 
Will no longer be diffuse but will be channeled into the successful move- 
ments, With every repetition the successful channels will be made more 
and more permeable, until stable nervous connections have been gr- 
ganized. 


* This assumption is echoed in James’ Principles where we read: “If pleasure and pains 
have no efficacy one does not see why the most noxious acts, such as burning might not 
give thrills of delight, and the most necessary ones, such as breathing cause agony” 

'32, p. 143). Another exponent of a pleasure-pain theory of learning was J. Mark Bald- — 


Win (9). See also Pyle (215). 
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Bain (8) formulated a similar theory: spontaneously begun move- 
ments which are accidentally successful cause pleasure, and with the 
pleasure there is an increase in vital energy. A few repetitions of this 
coincidence of pleasure and movement lead to a neural connection so 
that pleasure or the idea of the pleasure will evoke the successful move- 
ment at once. In the case of pain, the sequence is the reverse, leading to 
a decrease of vital activity and a blocking of movement. 

The theories of Spencer and Bain have been summarized in some de- 
tail here because certain salient features of the pleasure-pain theory have 
proved extremely persistent in later developments: 


1. The organism is said to respond to the problem situation by random 
movements and spontaneous discharges of the muscles. Later analyses of trial- 
and-error responses have been attacked because they seemed to imply that an 
animal's initial responses in a problem situation are random (148). This would 
be an extremely hazardous assumption to make, for in a problem situation an 
animal will rarely respond completely at random; even his trial-and-error re- 
sponses always have a certain directionality though they may at first appear 
random from the observer’s viewpoint. Thoughtful analysts of trial-and-error 
learning have recognized this fact and pointed out that the assumption of 
randomness is not necessary for a trial-and-error theory of learning (240). 

2. The causal efficacy of pleasure and pain in fixating successful responses 
and eliminating unsuccessful ones is explicitly asserted. Pleasure and pain are 
purely psychic concepts and mechanistically inclined thinkers have been re- 
luctant to endow them with such power over muscular responses (29, 33, 112). 
Hence the need to invoke physiological correlates of pleasure and pain to reduce 
their action to mechanistic principles. This need was felt not only by Spencer 
and Bain but also by later writers continuing the tradition of a pleasure-pain 
theory of learning (252, 264, 297). Most of these physiological explanations 
have been highly speculative, as are certainly those advanced by Spencer and 
Bain. There is, for example, no evidence to support the contention that pleasure 
is accompanied by heightened neural activity and pain by lessened activity. 
This may sometimes be the case but the opposite may be equally true, espe- 
cially in the case of intense pain. The trouble was thata pleasure-pain theory was 
really a theory of psychic causes leading to physical effects. Physiological ex- 
planations have been largely ad hoc and invoked for the sake of consistency 
with basic axioms about the mechanistic nature of the learning process.‘ 

3. The third feature which the pleasure-pain view bequeathed to modern 
learning theory is its insistence that a repetition of a successful response will 


4 The vexatious issue of hedonism was sidestepped by Hobhouse (110) who described 
effect in terms of confirmation and inhibition. Similarly Holmes (114) speculated that 
the congruity or incongruity of an act with the activity in progress constituted its effect. 
Such formulations, however, only postpone the consideration of the role played by satis- 
faction and annoyance. As Stephens (257, 258) has pointed out, if an organism unex- 
pectedly stumbles on a valuable outcome, the failure of the response to confirm the 
expectancy will not prevent the response from recurring. Similarly, a congruous eX 
pected response may be eliminated if it leads to unpleasant consequences. 


Pr- 
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strengthen it. There are many cases where this is true but probably equally 
many where the opposite is equally true, where continued repetition of a 
response will weaken that response (59). 

4. Let us finally note the central emphasis which a pleasure-pain theory 
Puts on the motor (movement) aspect of the connection between situation and 
response, It is the movement which is strengthened by pleasure and increased 
neural activity. Later formulations of the law of effect, especially in conjunction 
with conditioning principles (127) have placed equal emphasis on the strength- 
ening of afferent-efferent connections by reinforcement. Critics have insisted 
that movements as such are rarely learned and used by animals only as means to 
an end. It is probably safe to say that changes in the stimulus as the animal 
berceives it and interprets it (3, 29, 30) are just as essential as the fixation of suc- 
cessful movements. 


Tue NEW AssociaTIONIsM: THORNDIKE’s LAWS OF LEARNING 


The combination of a modified pleasure-pain (success-failure) 
Philosophy and connectionism found its most systematic and challeng- 
ing expression in the work of Thorndike. Even though he later dis- 
claimed hedonism (273), Thorndike's thinking has reflected that of the 
earlier exponents of the pleasure-pain principle as well as the associa- 
tionist doctrine. His central importance derives from the fact that he 
embarked on a monumental series of experiments, extending over half 
a century, to obtain empirical verification of his laws of learning. Partly 
on the basis of his study of the problem-solving behavior of animals, 
Thorndike formulated two basic laws of learning: the law of exercise 
and the law of effect. 

Other things being equal, the Jaw of exercise makes learning a function of 
the number of repetitions of the stimulus-response connections. 

The law of effect was stated as follows: “Of several responses made to the 
Same situation, those which are accompanied or closely followed by satisfaction 
to the animal will, other things being equal, be more firmly connected with the 
Situation, so that when it recurs, they will be more likely to recur; those which 
are accompanied or closely followed by discomfort to the animal will, other 
things being equal, have their connection with the situation weakened so that, 
when it recurs, they will be less likely to occur. The greater the satisfaction or 
discomfort the greater the strengthening or weakening of the bond” (264, p. 
2 More succinctly expressed, the law as originally stated asserts that suc- 
cess stamps in and failure stamps out. 


In the light of careful experimental analysis Thorndike later aban- 
doned the law of exercise, or at least he relegated this law to a very 
Minor position, and for him the law of effect became altogether central 
to the learning process (267, 269). At various times Thorndike’s state- 
na of the law of effect have been accompanied by an exposition of 

S probable physiological correlates. The physiological locus of reins 
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forcement by effect is put squarely in the neurons (e.g. 264, 267, 269, 
270, 273). Satisfying and annoying processes are “related to the main- 
tenance and hindrance of the life processes of the neurons” in whose 
changing synaptic conduction learning consists (264, p. 24). In other 
words, pleasure or satisfaction facilitates synaptic conduction, renders 
synapses more permeable, whereas discomfort or pain blocks synaptic 
conduction. 

The subsequent history of the law of effect has shown two major 
foci of interest. 

1. The first major focus of interest has been Thorndike’s law itself. Ever 
since its first formulation it has been subjected to a long series of attacks, at- 
tacks which were vigorously countered by Thorndike and his associates both on 
the level of theoretical discussion and in the experimental laboratory. In the 
course of these discussions Thorndike himself has shifted his theoretical position 
to some extent and reformulated the law. Concurrently many of the important 
parameters of reinforcement (such as amount of reward, frequency of reward, 
delayed reward, spread of effect, etc.) were investigated. 

2. A second focus of interest has developed around the integration of the 
law of effect with the facts and theories of conditioning. In Hull’s systematic 
theory of learning (127), the law of effect occupies a central position. On the 
other hand, the law of effect has been singled out for attack by those who are 
opposed to a conditioning-reinforcement type of learning theory (e.g., Tolman 
and his associates: 289, 294, 315). 


The law of effect has thus become an issue around which systematic 
differences in behavior theory have been crystallized. Defense and 
condemnation of the law of effect have almost become symbolic of 
widely divergent approaches to the problems of behavior analysis and 
learning theory. The discussion which follows will be centered around 
these two foci of interest and will also attempt to cover a representative 
selection of studies dealing with the parameters of reinforcement. 


THE CLASSICAL OBJECTIONS TO RETROACTION 
ation of the classical objections to the 


law of effect. In the language of the law of effect, the consequence ofa 
response strengthens the response, i.e., the effect “works back” on the 
connection which it follows. Such a formulation involves the assumption 
of retroaction. Critics asked, how can an effect work upon a response 
which is already passed (29, 33, 46, 212, 219)? The logical difficulty is 
obvious, and Thorndike has not been inclined to minimize it. He has 
suggested that the physiological equivalent of the connection does not 
vanish instantaneously but is still present when the consequence (re 
ward or punishment, knowledge of results) occurs (269, p. 481). The 
original statement concerning backward action should then be modifie 


Let us turn first to a consider 
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to read that one type of after-effect (satisfier or annoyer) acts on other 
after-effects of a connection (the persisting physiological equivalent of 
the original connection). 

The same logic for bridging the temporal gap between connection 
and consequence and avoiding the cul-de-sac of retroaction underlies 
Hull's use of the concept of stimulus-trace. Upon cessation of a stimulus 
(exteroceptive or proprioceptive) the afferent impulse continues its 
activity for a finite short period, and a short-range temporal integration 
is achieved. Hull writes, “This perseverative stimulus-trace is biologi- 
cally important because it brings the effector organ en rapport not only 
with environmental events which are occurring at the time but with 
events which occurred in the recent past, a matter frequently critical 
for survival” (127, p. 385). Not only may the physiological correlate ofa 
stimulus or response persevere in time but symbolic processes may help 
the human learner and, to a lesser extent, the animal learner to bridge 
the temporal gap between the occurrence of the connection and the 
action of the satisfier (192). 

There have been other attempts to slip from under the horns of the 
dilemma of retroaction. Accepting the apparent fact of backward action 
at its face value, retroflex circuits in the brain (297), neural irradiation 
(96), changes in the electrical resistance of neural connections (251) and 
their readiness to conduct (252, 267), the rearousal of just-active path- 
ways (44)—all these were suggested as possible mechanims mediating 
the action of satisfiers and annoyers. These physiological explanations 
can be called neither right nor wrong. They are clearly speculative and 
many steps removed from the level at which experimental verification 
is at present possible. 

The most radical solution of the problem has been to deny retroac- 
tion altogether and to affirm that in spite of the appearances to the 
contrary effect always operates in a forward direction. What is modified 
is not the connection which is passed and done with but the stimulus 
on the occasion of its next appearance. ‘The burned child shuns the 
sai ‘aa oa yee his movements, but because, since 
seep the rea ey Ç Er ; F now flame plus fear, no longer 
T Ss wa y ‘ af os ) sepa ih Carr ascribed the fixa- 
a om es imination of responses primar! y to the sensory consequences 

rather than to the strengthening of S-R connections. “These 


Consequences do not influence the portion of the act that preceded 


them... they do affect the subsequent functioning of the act” (29, 
n 96). Whether or not it is possible to cut the Gordian knot of retroac- 
ton by insisting that effects exert their influence in a forward direc- 
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tion, criticisms such as those of Hollingworth and Carr were important 
because they shifted the emphasis from the response to the stimulus. 

In trying to evaluate the arguments for and against retroaction, one 
is struck by the fact that the battle has been fought largely on the plane 
of logic (312). There can be no doubt that rewards and punishments 
following responses modify these responses on the occasion of their 
recurrence. Not enough is known about the basic mechanisms of learn- 
ing to decide whether consequences of a response can or cannot act back 
on a connection or whether perforce they must exert their effect in a 
forward direction. Operationally speaking, the consequences of a re- 
sponse always act in a forward direction since they can only be tested 
on the occasion of a future occurrence of the stimulus situation. Physio- 
logically speaking, when all is said and done we had best admit our ig- 
norance of the action of cortical neurones. Perhaps part of the confusion 
has stemmed from a premature reification and physiologizing of the 
terms involved. The words connection and effect are logical constructs, 
the words stimulus and response are generic terms denoting classes of 
events with certain properties in common (237). It is only too tempting 
to forget the limitations of these concepts and to manipulate abstrac- 
tions as if they were clearly delineated entities with equally unequivocal 
physiological counterparts. For analytical purposes we need such con- 
cepts as stimulus, response, and connection, but are we not over-optimis- 
tic when we assume that the nervous system makes its division along the 
same lines? If misplaced physiological concreteness is avoided one may 
escape logical difficulties such as the dilemma of retroaction. 


Wuat Is THE NATURE OF SATISFIERS? 


Whereas some critics were most concerned with the mechanisms 
mediating effect, others focussed their attention on the nature of the 
eatisfiers and annoyers to which reference is made in Thorndike’s law. 
Although Spencer and Bain, in whose tradition Thorndike continued, 
frankly invoked pleasure and pain as agents responsible for the fixation 
and elimination of responses, Thorndike’s law has been a law of effect, 
not affect (113). He carefully defined satisfiers and annoyers in terms 
independent of subjective experience and report. “By a satisfying state 
of affairs is meant one which the animal does nothing to avoid, often 
doing such things as to attain and preserve it. By a discomforting state 
of affairs is meant one which the animal avoids and abandons” (264, 
p. 245). Although admittedly free of hedonism, such a definition of 
satisfiers and annoyers has faced another serious difficulty: the danger 
of circularity. The critic may easily reword the definition to read: “The 


——-. 
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animal does what it does because it does it, and it does not do what it 
does not do because it does not do it.” This reductio ad absurdum is 
probably not entirely fair but it points up the danger of the definition in 
the absence of an independent determination of the nature of satisfiers 
and annoyers. The satisfying or annoying nature of a state of affairs 
can usually be determined fully only in the course of a learning experi- 
ment and cannot then be invoked as a causal condition of learning with- 
out circularity (108). In their experimental work Thorndike and his 
associates have made no significant attempts to establish the satisfying 
or annoying nature of their rewards and punishments independently of 
the learning experiment. The tacit assumption has been made through- 
out that the words “Right” and “Wrong” and/or small money gains 
and losses are, indeed, satisfying and annoying states of affairs. As 
Tolman and his co-workers have argued on the basis of strong experi- 
mental evidence, such events (including shocks) can equally well be 
Conceived as signals or “emphasizers,”’ providing the subjects with in- 
formation as to the correct response (294). We shall return shortly to the 
Problem of cognitive (informative) vs. satisfying and annoying after- 
effects of a connection. 

Thorndike’s definition of satisfiers and annoyers has been defended 
against the stigma of circularity by emphasizing its empirical adequacy 
(171). “The primary fact is that there is a state of affairs, whatever 
its psychological classification, which happens as a result of, or at least 
after, an act. The state of affairs may be symbolic... or it may be 
Primarily sensory... or it may be complexly perceptual” (171, pp. 
576-577). This statement is fully in line with Thorndike’s own reformu- 
lation of the nature of a satisfying state of affairs. In his more recent 
Writings Thorndike has conceptualized the action of satisfiers in terms 
of the O.K. reaction (273, 277), a fundamental confirming reaction 
brought to bear by the organism on connections between situations and 
acts. He describes the O.K. reaction as the ‘‘unknown reaction of neu- 
Tones which is aroused by the satisfier and which strengthens connec- 
tions on which it impinges” (273). This reaction is independent of 
Sensory pleasure. It is “far from logical.” It strengthens “connections 
which are wrong, irrelevant, and useless.” It is said to bear little rela- 
tion to the intensity of the satisfier. Neither specific motivation nor 
effects relevant toa specific motivation are any longer assumed. 

Stripped of virtually all defining properties and qualifications, the 
law does indeed have a very wide range of applicability but only at the 
€xpense of vagueness. The sum and substance of the argument now is 
that something happens in the organism (nervous system) after an act 
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is performed. The fact that something happens influences further ac- 
tion. This something is, however, so little defined that it has almost no 
predictive efficiency. The O.K. reaction has no measurable properties, 
the conditions for its occurrence are so general as to embrace almost 
every conceivable situation. Hence the operation of the O.K. reaction 
can be inferred only ex post facto, after learning has taken place. But 
here we are impaled again on the horns of the dilemma of circularity. 

Perhaps because the reformulated empirical law of effect had a de- 
gree of generality which made it an unsatisfactory tool for the predic- 
tion of specific facts of learning, several auxiliary principles were 
gradually introduced in the connectionist framework: the principle of 
belongingness, according to which S-R correlations which “hang to- 
gether” are learned more easily than connections lacking this quality; 
the principle of impressiveness which states that vivid stimuli are favored 
in learning; the principle of polarity according to which stimulus-re- 
sponse sequences function most readily in the order in which they were 
practiced; the principle of identifiability stating that the most easily 
identifiable connection is most easily learned; and finally the principle 
of availability which has it that the more available a response the more 
easy it is to connect it to a stimulus (227, 269). It is easy to sce that 
most of these principles are closely akin to the concepts of Gestalt 
psychology. According to Brown and Feder (18), “these conditioning 
factors which for Thorndike are so many ad hoc hypotheses, are logically 
prior to his other laws.” These writers claim that Thorndike’s theory 
of learning could be successfully rewritten in terms of Gestalt psychol- 
ogy. Whether or not this assertion is true,® the use of these auxiliary 
principles serves to point up the inadequacy of the O.K, reaction as a 
universal principle of learning. 

It is very probable that the modifications in Thorndike’s description 
of reinforcement were largely influenced by the results 


of the process 
he years. Most of his experiments 


of his own experimental labors over t 
were concerned with rote verbal learning and the acquisition of simple 
skills (269), and it is the results of these experiments which were inter- 
preted in terms of the confirming reaction and such auxiliary concepts 
as belonging and identifiability. Other advocates of the law of effect, 
especially those whose experimental interests have been more in the 
fields of conditioning and discrimination learning, have tended to insist 


5 Certainly Thorndike would not agree. He professes his “inability to understand” 
Gestalt psychology. He feels suspicious of the concepts of Gestalt psychology “because 
for a time they were offered as a substitute for psycho-vitalism” (267, p. 125). He con- 
cludes that a connectionist theory is “far simpler and more in accord with what the 
neurons are and can do” (p. 131). 


af 
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on a version of the law in which the term effect retained much more of 
its original meaning and had direct reference to the primary and second- 
ary needs of the organism. Let us consider, for example, a recent formal 
statement by Hull of his law of primary reinforcement which, in his own 


words, is distinctly related to the law of effect. The law reads as fol- 
lows: 


Whenever an effector activity occurs in temporal contiguity with the 
afferent impulse, or the perseverative trace of such an impulse, resulting from 
the impact of a stimulus energy upon a receptor, and this conjunction is closely 
associated in time with the diminution in the receptor discharge characteristic 
of a need, there will result an increment to the tendency for that stimulus on 
subsequent occasions to evoke that reaction (127, p. 80). 


Reduction of a need is a critical factor in the reinforcement process, 
and by need Hull means a condition in which any of the commodities 
necessary for the survival of the organism are lacking or deviate seri- 
ously from the optimum (127, p.17). Disturbances in the “optimal con- 
ditions of air, water, food, temperature, intactness of bodily tissue, 
and so forth” give rise to needs and activities of the organism for the 
search of survival. It is the highly important adaptive role of the learn- 
ing process to reinforce those movements which will lead to need reduc- 
tion and thus help the animal to survive. Hull recognizes, of course, 
that much learning takes place in the absence of immediate reduction 
of biological needs. To account for such learning, the principle of second- 
ary reinforcement is invoked. “The power of reinforcement may be 
transmitted to any stimulus situation by the consistent and repeated 
association of such stimulus situation with the primary reinforcement 
which is characteristic of need reduction” (127, p. 97). Thus all reinforce- 
ment depends directly or indirectly on the reduction of ‘“‘viscerogenic”’ 
needs. Since reinforcement is a basic condition of learning, it follows 
that learning is primarily an adaptive process in the interest of species 
Survival. Biological needs provide the foundation on which the complex 
Structure of a lifetime of learning is built. 

Hull's formulation of the process of reinforcement has been echoed 
by many other investigators in the field of learning. In a long series of 
€xperiments (some of which will be discussed in detail below) O. H. 
Mowrer, for example, has attempted to produce experimental evidence 
and theoretical argument for a version of the law of effect which is 
closely parallel to that of Hull in its emphasis on tension reduction. A 
typical statement by Mowrer claims that 
++. learning is dependent not upon the mere association, or temporal contiguity, 
of stimuli (or responses) but rather upon the occurrence of a state of affairs 
which has been variously designated as goal-attainment, problem-solution, 
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pleasure, success, gratification, adjustment, reestablishment of equilibrium, 
motivation-reduction, consummation, reward (190). 


Elsewhere Mowrer makes it unequivocally clear what he means by 
satisfaction. He writes: 

Most writers agree that satisfaction ought to be equivalent to pleasure and 
dissatisfaction equivalent to pain. But, then, what do we mean by “pleasure”? 
One view is that we can equate pleasure to drive or tension-reduction and pain 
to drive or drive-increase. . .. Satisfaction, pleasure, and drive-reduction are 


strictly equivalent (185).° 

Mowrer’s interpretation of effect or a satisfying state of affairs is 
frankly and emphatically hedonistic.’ Ultimately all behavior modifi- 
cation is mediated by pleasure (tension-reduction). The statement has 
a very familiar ring. The history of learning theory has completed a 
cycle, and the views of Spencer, Bain, and Baldwin are with us again 
though transplanted into the conditioning laboratory and accompanied 
by great refinement and sophistication in experimental techniques and 
procedures. The historian of learning will be especially interested to note 
that while Thorndike, the original advocate of the law of effect, has 
frequently dissociated himself from a hedonistic interpretation, there 
has emerged a group of neo-hedonists whose interpretation of the law is 
much closer to its historic roots. They were willing to assume the re- 
sponsibility of hedonism in order to make the law more useful in the 
design and interpretation of experiments. 

To many psychologists hedonism today is no more acceptable than 
urn of the century. A vigorous attack against the preser- 
vation of a hedonistic doctrine in the law of effect has been leveled by 
Allport, who has been especially concerned with the implications of a 
law of effect for the growth and development of personality. A strict 
adherence to a hedonistic law of effect would imply that “personality 
grows through the production of tentative trial-and-error acts, some of 
which are selected by the Great God Pleasure for establishment, and 
some by the Great God Pain for banishment” (2, p. 154). Allport be- 
lieves that the main fallacy of hedonism is the confusion of the by- 
product of a complex process with the process itself. He would agree 
that pleasure (tension-reduction) at times is an indicator of the success- 
ful accomplishment of an act (4) but emp 


it was at the t 


hatically denies that pleasure 


process of reinforcement is found 


6 Similar emphasis on tension-reduction as the basic ; 
his work on the effect of electric 


in the writings of Meunzinger (199) in connection with 
shock on learning. 

7 There is an important way in w 
hedonists. For the hedonists pleasure was a motive, W 


theorists it is tension, or discomfort, which is the motive w 
tension-reduction which leads to learning (192). 


hich such a view differs from that of the classical 
hereas for the modern learning 
hile pleasure is the result 0 
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is the main causative factor in the fixation of responses. He also takes 
issue with the law of effect because it perforce implies a segmental ap- 
proach to the learner, ascribing learning to the satisfaction of this drive 
or that drive with the main emphasis on the reinforcement of responses 
(3). For Allport it is the person who learns, is rewarded, and utilizes 
his success in his future adjustemnt. Allport thus pitted an ego-oriented 
psychology of learning against the neo-hedonism of the law of effect. To 
this controversy we shall return presently. 

In attempting to evaluate the controversy which has raged around 
the definition of satisfiers one is struck by the key importance of the 
hedonistic issue. Certainly hedonism is an immediate ancestor of the 
law, and now that the principle of effect has reached an uneasy ma- 
turity it is clear that it cannot deny its origin without sacrificing much 
of its vigor. When the law is stripped of hedonistic implications, when 
effect is not identified with tension-reduction or pleasure (as by Thorn- 
dike), the law of effect can do no more than claim that the state of 
affairs resulting from a response in some way influences future responses. 
Such a statement is a truism and hardly lends itself to the rigorous de- 
duction of hypotheses and experimental tests. If a neo-hedonistic 
Position is frankly assumed (as, e.g., by Mowrer) the law becomes an 
important tool for research, provided ‘‘satisfaction”’ is independently 
defined and not merely inferred from the fact that learning has occurred. 
The hedonistic position can be defended for the range of events for which 
It is experimentally demonstrable. It is probably fair to say that up to 
the present this range is fairly narrow and largely in the domain of 
animal learning, 

The main source of difficulty has been the universality claimed for 
tension-reduction as a principle of reinforcement. Hull and Mowrer, 
On the one hand, and Allport, on the other, probably would agree that 
Teduction of the food need is a critical factor in a rat’s learning the true 
Path through a maze. With the aid of the principle of secondary (and 

igher order) reinforcement, tension-reduction can then be made the 
ultimate condition of all learning. Such a generalization Allport rejects. 
-Tucial experimental tests to arbitrate between these two interpreta- 
tions of more complex forms of learning still seem to be lacking. For 
the time being, therefore, acceptance or rejection of the hedonistic 
Postulate must remain a matter of philosophical preference. 


THE NATURE OF PUNISHMENT 
Thorndike’s View of the Effect of Punishment 


The preceding discussion has dealt primarily with the nature and 
action of satisfaction (reward). As originally stated, the law of effect 
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was a two-pronged statement, putting the stamping-out powers of 
punishment on a par with the stamping-in effects of reward. In the 
light of experimental results Thorndike no longer considers punishment 
as an effective agent for the elimination of wrong responses. Indeed, 
again as a result of experimental evidence, a punished connection is now 
held to be strengthened more by sheer occurrence than it is weakened 
by punishment. Punishment is not the dynamic opposite of reward; it 
is not even capable of overcoming the effect of sheer occurrence (269, 
279). This view is, of course, an implicit resurrection of the law of exer- 
cise since sheer occurrence (frequency) is now credited with a certain 
degree of effectiveness in fixating responses, an effectiveness, moreover, 


which surpasses that of punishment. 
An impressive number of experimental investigations have been 


conducted by Thorndike and his associates to demonstrate the ineffec- 
tiveness of punishment (162, 163, 165, 166, 223, 268, 269, 276, 299). In 
most of these experiments the punishment consisted of the announce- 
ment Wrong made by the experimenter, accompanied sometimes by a 
small fine or an electric shock. The effect of the punishment was assessed 
in terms of the number of repetitions of the punished response as com- 
pared to the number of repetitions that would be expected on the basis 
of chance, i.e., if the subject were guessing at random. The results of the 
experiments led Thorndike to conclude that punishment, instead of 
weakening or “‘stamping-out” the wrong response, may have a variety 
of effects depending on the specific nature of the annoyer and the propen- 
sities of the organism. The important point is that what an animal is 
led directly to do by an annoyer may or may not make the repetition of 
the punished response less likely (269). If punishment does lead to 
the elimination of a response, its action is indirect: it leads to variability 
of behavior, thus increasing the opportunities for the occurrence of 
the correct response which is then reinforced by the direct action of the 
satisher (OK reaction). 

Thorndike’s revised view of punishm 
public. Not only did it contradict the belie i Í I value o 
punishment which had become almost an axiom 1n our social life, it 
was also clearly at variance with a considerable body of other experi- 
mental data. We shall now consider (1) the criticisms leveled against 
Thorndike’s work on the effect of punishment, and (2) other theoretical 
and experimental developments bearing on the role of punishment 12 


ent startled the psychological 
f in the practical value of 


learning. 


Criticisms of Thorndike’s View of The Effect of Punishment 


One serious criticism of Thorndike’s data was statistical in nature. 
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The effect of punishment was measured in terms of the deviation of the 
obtained number of repetitions from the number of repetitions to be 
expected by chance. Thus, if there were five possible responses to an 
item, Thorndike would consider the chance expectation of each re- 
sponse to be .20, and the obtained percentages were expressed as devia- 
tions from .20. Thorndike was thus applying the principle of indiffer- 
ence which considers alternative events equally likely in the absence of 
information to the contrary. The trouble is that there seems to be some 
information to the contrary, viz., that the ‘natural choice frequency 
is usually not a chance one” (123). Whether or not Thorndike’s correc- 
tion for chance is adequate is an empirical question. It is necessary to 
compare the extent of actual repetition without after-effect with the 
extent of repetition following announcement of Wrong. Experimenters 
who used an empirical rather than an a priori baseline for the evaluation 
of the effects of Wrong find that punishment does have a weakening 
influence which is commensurate with the strengthening influence of 
reward (254, 255,283). Thorndike and his associatesfail to find such an 
effect even when favoritism of response is taken into account (162, 166). 

Failing to find a significant weakening of connections following the 
announcement of Wrong, Thorndike denied the efficacy of punishment 
in general.’ Yet even within the restricted framework of a typical Thorn- 
dike experiment, and with as highly specific a “punishment” as the 
announcement of Wrong, the results vary considerably with the param- 
eters of the experimental situation. The medium by which the punish- 
ment is conveyed is important: it may make a difference whether 
Wrong is announced by spoken word or by, say, a signal light. The effect 
of the medium, the sheer “something happening”’ after a response may 
be powerful enough to obscure the weakening influence of punishment. 
Such, at least, was the conclusion of Stephens who argued that whenever 
Punishment strengthens a bond, such strengthening may be due to the 
Physical medium, whereas reward and punishment have directly oppo- . 
site effects when measured from the baseline of informationless after- 
effects (a flash of light, a nonsense syllable) (255).° Jones, (139) on 


8 While the majority of experiments were done with human subjects, Thorndike illus- 
trated the superiority of reward also with animal subjects (268). 

9 Thorndike and his associates also found that informationless after-effects such as a 
Click led to more frequent repetitions than the announcement of Wrong. They were 
Unwilling to conclude, however, that punishment had weakened a response and argued 
that a “neutral” after-effect was ambiguous enough to lead to self-administered reward 
Since the subjects were free to interpret the neutral signal us a reward (165, 166) Such 
an argument comes dangerously close to question-begging. It indicates an unwillingness 
Even to consider the possibility that punishment may have a weakening effect on a con- 
nection. 
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the other hand, found that the medium by which punishment is con- 
veyed makes little difference and that the elimination of errors depends 
on the position of the punished response in the pattern to be learned. 

Whatever the effect of the medium by which the punishment is 
conveyed, the initial strength of the association will be an important 
determinant of the effect. A strong association can be considerably 
weakened, a weak association which is close to zero cannot be inter- 
fered with much further. Stephens was able to show experimentally 
that strong connections (responses which appeared early and were 
persistent) are markedly and dependably weakened by the announce- 
ment of Wrong whereas initially weak ones are not. Correspondingly, 
Right has little, if any, influence on initially strong associations but con- 
siderably aids in making weak bonds strong (254, 256). Tilton (284, 
285) also emphasizes the importance of taking the initial strength of a 
response into account. 

There is another feature of the experimental situation which Thorn- 
dike fails to take into account but which may be, at least in part, re- 
sponsible for his failure to find a direct effect of punishment. In most 
of the experiments, there was only one right response and several wrong 
alternatives. Thus it was the subject's task to learn positively only 
one item, but to eliminate several items. By virtue of its uniqueness, 
the right response is “figural” (has, as it were, high visibility for the 
subject), whereas a wrong response is one in a long homogeneous series 
of wrong responses (213). These considerations receive experimental 
support from the work of Dand (45) who equated the number of right 
and wrong alternatives and presently found an announcement of Wrong 
to have a definite weakening effect. Lorge (162) reports that the potency 
of a punishment is less the higher the initial probability of obtaining 
a right response (though not reliably so). It is safe to say that the pat- 
tern and sequence of Rights and Wrongs are at least partial determinants 
of the fixation and elimination of responses. 

The complex ways in which the effects of Wrong vary with the param- 
eters of the experimental situation highlight the need for caution in 
generalizing about the effects of punishment. Even when the punish- 
ment consists of a simple announcement of Wrong, the experimental 
evidence is still inconclusive. An announcement of Wrong is only a very 
special type of punishment situation. Indeed, one may question the 
appropriateness of the term punishment or annoyer for the description 
of an effect which is primarily informative in nature. In most of Thorn- 
dike’s situations, the connections to be learned are purely arbitrary and 
it is easily obvious to the subject that being right or wrong cannot poss!- 
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bly reflect on his intelligence. He may be neither gratified by being 
right nor annoyed by being wrong. Wrong may merely serve as a signal 
to substitute one arbitrary guess for another. We must reiterate the 
need to define satisfaction and annoyance independently before in- 
voking them as determinants of the learning process. 


Other Studies of the Role of Punishment 


When we turn to punishments other than an announcement of 
Wrong or slight monetary losses, the uncertainty of Thorndike’s gen- 
eralization becomes even more apparent. An impressive array of ex- 
perimental evidence can be marshalled in support of the proposition 
that punishment is an effective condition of learning. We shall first 
review the empirical findings and shall then consider the main theoreti- 
cal issues concerning the nature of punishment and the mechanisms 
of its action. 

Society has devised many and varied punishments for those who 
offend against law and convention but in the psychological laboratory 
punishment has come to be almost synonymous with electric shock. 
Shock is easily administered and it is always sure to have a physiological 
impact though its psychological effects are highly variable (296). Thus 
most of the data on punishment come from studies on electric shock. 

A review of the literature on punishment raises the following prob- 
lems: 

1. Does punishment have any significant effect on learning? 

2. Is the effect of punishment generalized to the entire learning situation or 
does it lead only to the elimination of specific punished responses? 

3. What is the mechanism by which punishment exerts its effects? Does 
Punishment weaken stimulus-response bonds? Does it affect the general level 
at which the organism functions? Or is its effect indirect, leading to increased 
Variability of behavior and ultimately to differential reinforcement by reward? 

4. Does the main effectiveness of punishment reside in the information 
about errors with which it provides the learner or does it have a more im- 
Mediate influence on associative strength? 


Does punishment have any significant effect on learning? We have al- 
ready seen that Thorndike answers this question in the negative but 
that the validity of this conclusion from his data is doubtful. The 
answer to this question is, trite as it may sound, yes and no. It is rather 
unfortunate that Thorndike and his associates have tended to put the 
issue on an all-or-none basis. The question should not be, “‘does punish- 
ment have an effect on learning or does it not?” but rather, ‘‘under 
what conditions is it effective and under what conditions does it fail to 
show results?” m 

Against the blanket assertion that punishment is not instrumental 
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in the elimination of wrong responses it is possible to cite a long list of 
papers covering more than half a century of experimental -work which 
report that punishment is an effective condition of learning. 

Without attempting to review these experiments in detail, we shall 
summarize the main findings. 

1. A large number of different organisms have been spurred on to better 
learning by punishment:!° earthworms (325), cockroaches (261, 300), dancing 
mice (324), rats (e.g., 20, 54, 56, 57, 111, 116, 155, 193, 196, 199, 200, 201, 202, 
204, 206, 301, 307, 322), chicks (36), cats (53), and, last but not least, human 
beings (10, 12, 13, 19, 24, 25, 26, 27, 41, 55, 79, 80, 87, 137, 138, 172, 173, 217, 
218, 302). f 

2. Punishment, usually by electric shock, has improved learning in a variety 
of problem situations: discrimination boxes, mazes, reaction-time procedures, 
serial learning, mirror tracing, multiple choice situations. 

3. Punishment has been found to affect different measures of learning. In 
many studies, in which responses are scored as either right or wrong, punish- 
ment has led to a decrease in the nu mber of errors (10, 13, 24, 25, 26, 27, 36, 41, 
54, 55, 56, 57, 79, 80, 87, 111, 116, 137, 138, 172, 173, 193, 196, 199, 200, 201, 
202, 204, 206, 218, 301, 302, 307, 326). When speed of learning is measured in 
terms of the number of trials required to reach criterion, we find that punish- 
ment often results in a smaller number of trials (19, 24, 25, 26, 27, 36, 41, 54, 55, 
111, 137, 155, 171, 193, 196, 199, 201, 202, 204, 206). Some investigators report 
the fact that punishment reduces the time required for learning (24, 25, 41, 55, 
79, 80, 87, 206, 302) but time has proved to be a difficult measure to interpret 
since punishment typically leads to a cautious hesitant attitude on the part of 
the learner so that a decrease in the number of trials and errors may be accom- 
panied by an increase in time per trial (10, 24, 25, 26, 172, 173, 206, 218). 

4, The effectiveness of punishment may also be demonstrated by retention 
tests. There are experimental reports suggesting that (1) punished groups 
remember the learned task better (27, 41, 56, 79, 80, 81, 206, 301), (2) punished 
groups are less subject to retroactive inhibition (27). Different measures 0 
retention show the effect of punishment to unequal degrees, however (41, 81, 


301). 


On the basis of the experim 
at least that punishment wor 
But note the qualifier some. N 


ental evidence it is thus possible to assert 
ks in some situations some of the time. 
one of the papers cited would justify the 
conclusion that punishment works in all situations all of the time. On 
the contrary, the experimental evidence emphasizes the extent to which 
the effectiveness or non-effectiveness of punishment depends on the pa- 
rameters of the experimental situation. 


1. The intensity of the punishment is i 


(326) pointed out more than forty years ago, inves o i 
confirmed, there is in a given learning situation an optimal intensity of punish- 


ment. The relationship between intensity of punishment and learning efficiency 
is not linear over the total range of intensities. Once the optimal intensity has 


mportant. As Yerkes and Dodson 
and many investigators since have 


10 Conditioning studies employing a punishing UcS are not included here. 
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been exceeded, efficiency of learning may decrease (36, 53, 87, 302, 326, 328). 
There is moreover, no one optimal intensity for any given organism. The 
optimal intensity will vary significantly with (1) the difficulty of the task and 
(2) the stage of learning at which punishment is applied. = 

2. There has been rather general agreement that relatively severe punish- 
ment (intensive shock) is most effective in the learning of simple habits such as 
black-white discrimination and simple maze patterns, and that relatively mild 
Punishment is optimal in the case of difficult tasks such as more complex types 
of discrimination and exacting maze patterns (12, 26, 36, 53, 326, 328). 

3. The effects of shock may become disruptive (12, 79, 80, 87, 294, 303), 
especially if applied early in the learning process (25, 206, 218). Shock may also 
lead to increased variability of responses as compared with milder forms of 
Punishment (260). ¥ 

4. As to the stage of learning at which punishment is most profitably in- 
troduced, we have already referred to Stephens’ general conclusion that the 
greatest effect is achieved with strong connections (254, 256). This conclusion 
is borne out by the findings of Valentine (301) and by a systematic investiga- 
tion of Bunch (25) who got better results with a few shocks late in the learning 
than with many more administered earlier in the process. The results of his 
experiments led Bunch to believe that for a given task there is an optimal com- 
bination of position in learning and number of trials with shock. This generaliza- 


tion clearly reflects the dependence of the effect of punishment on the experi- 
mental parameters. 


Any generalization about punishment based on a single experiment, 
or even a series of experiments, thus becomes extremely hazardous. For 
a given organism, there are combinations of conditions under which 
Punishment is effective. If the experimenter works outside this range of 
Combinations, punishment fails to yield significant results (Thorndike 
and co-workers). If he works with several combinations of conditions, 

e may easily find that punishment is effective in one case but not in the 
other and certainly that it varies in effectiveness (12, 26, 87). If he 
Works with only one set of parameters, he may be tempted to over- 
generalize. Such seems to have been the case in Thorndike’s experi- 
ments on the effect of Wrong since most of the connections subjected to 
Punishment were probably rather weak (254), and the punishment 

ardly very intensive in nature. Such lack of consistency in results need 
Not lead to a counsel of despair regarding the possibility of generaliza- 
tion about punishment, but merely to an insistence that any such gener- 
alization should be based on functional relations covering a wide range 
of parameters and not on isolated, restricted experiments. 

A special problem of control needs to be mentioned here. In an 
attempt to hold as many of the experimental conditions as possible 

constant,” the experimenter may decide on a certain type and intensity 
of punishment and then adhere to it throughout the course of the experi- 
ment, He would then overlook the fact that the effectiveness“of a 
Punishment changes as a function of the number of times that it has 
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been administered. In the case of shock, there are clear-cut experimental 
data to substantiate this point. Investigating electric shock as a moti- 
vating stimulus in conditioning experiments, Kellogg (143) found that 
in order to maintain a flexion reflex of fixed extent it is necessary to ad- 
minister relatively large voltages at the beginning of the series and to 
reduce the stimulus intensity toward the end of the scries. He also 
found that the intensity required varied with the distribution of prac- 
tice. In addition, Kellogg reported wide individual differences among 
his subjects (dogs) in sensitivity and emotional reaction to the same 
physical shock. He concluded: 


It seems reasonable to infer that in learning experiments where the physical 
properties of an electrical stimulus are constant, the effects produced in the 
same subject at different times must differ greatly. The attempt to maintain 
“constant motivation” by this method probably defeats its own purpose (143, 


p. 95). 


Considerable individual differences in reactions to shock have also been 
reported for human subjects (296). Too few experimenters have heeded 
Kellogg’s admonition to consider the effectiveness of a shock at a given 
point in time rather than its physical intensity. 

Is the effect of punishment general or specific? In the framework of 
connectionism, it is always a specific bond which is strengthened or 
weakened. Thus Thorndike and his co-workers have always concerned 
themselves with the history of specific rewarded and punished responses, 
finding punishment ineffective. Other investigators have found, how- 
ever, that punishment may have a much more general effect: there may 
be a general improvement in performance, with unpunished responses 
benefiting along with the specifically punished ones (10, 12). Even if 
the punishment itself is made non-specific (‘‘non-informative shock”), 
and not administered as a consequence of any particular response, there 
is still a facilitating effect (11, 12, 19, 25, 79, 80). The work of Muen- 
zinger is of especial importance here. In a series of carefully controlled 
experiments he was able to show that (1) shock administered to rats 
after the choice-point in a maze facilitates learning and is more effective 
than hunger-motivation alone (193, 196); (2) shock before the choice- 
point decreases the efficiency of learning" (204) as does shock at the 
moment of choice (71); and (3) if shock is administered throughout the 
maze, its effect is only slightly less than with punishment after the 
choice point (199). These findings apply to shock for right responses as 
well as shock for wrong responses. When both right and wrong re- 
sponses are shocked, there is no summation of effects though learning 
efficiency is greater than if either right or wrong choices alone are fol- 


lowed by shock (204). 
u This retardation is probably due to the persistence of position habits which become 
fixated (70, 101). 


ass Ne 
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The question asked at the beginning of this section, whether the 
effect of punishment is general or specific, turns out to bean unwise one. 
Clearly, the effect may be general, specific, or both, depending on the 
way the experiment is arranged and the measurements are made. Ina 
conditioning experiment, for example, in which punishment is employed 
for the purpose of establishing a discrimination, the effect is of necessity 
highly specific. A response to, say, one tonal frequency is reinforced, 
and the responses to other frequencies are weakened or eliminated. It 
would be obviously nonsensical to expect unpunished responses to be 
affected in the same way as punished ones. At the other extreme of the 
continuum of specificity-generality is a situation in which punishment 
is administered without reference to any particular response, say, after 
a block of trials. In that case, the only possible effect that punishment 
can have is general, inducing an attitude of caution, serving as an in- 
centive, etc. The general conclusion on the generality-specificity issue 
is that punishment has both general and specific effects, sometimes one, 
sometimes the other, sometimes both, depending on its role in the total 
experimental situation. 

What is the mechanism by which punishment exerts its influence? There 
has been considerable speculation about the events in the organism 
which mediate the effect of punishment. This is, of course, an example 
of the search for intervening variables. Punishment is the independent 
Variable, the effects of punishment (positive and negative) are the de- 
Pendent variables. In terms of what hypothetical events can the re- 
lationship between these two variables be conceptualized? The answers 
which investigators have suggested fall into three broad classes: (1) 
Mechanisms of action which are held to be specific to punishment, (2) ex- 
Planations which reduce whatever effect punishment may have to the 
ultimate action of rewards, and (3) explanations which ascribe the ef- 
fectiveness of punishment to the information or perceptual emphasis 


Which it provides. 


1. Mechanisms specific to punishment. Throughout his writings on punish- 
Ment Thorndike has referred to the weakening or stamping out of stimulus- 
Tesponse bonds. Such weakening effects as he has found were ascribed to 
Processes detrimental to the conductivity of the neurones. For the last fifteen 
years, however, he has argued against significant effects of punishment and 
therefore has not further concerned himself with a mechanism to account for 
the weakening of specific S-R bonds by punishment, putting the entire burden 
of explaining learning on the action of rewards. , ; 

An important suggestion as to the mechanism by which punishment affects 
the emission of specific responses comes from students of operant conditioning 

68, 237). Their experiments deal with the effects of negative reinforcement 
Punishment) on the lever pressing behavior of rats. The general conclusion is 

at punishment results in a suppression rather than weakening of a response. 
Administration of punishment temporarily depresses the rate at which responses 
re emitted, but the suppressed behavior will be released when circumstances 
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are more favorable. In terms of Skinner’s concept of reserve, punishment 
affects the relation between reserve and rate of emission of responses but has 
no effect upon the reserve itself. The effect of punishment is therefore emo- 
tional, leading to a temporary depression of function but not to a permanent 
weakening or elimination of responses. These findings not only serve to em- 
phasize the differential effects of reward and punishment on the emission of 
responses but also convey an important methodological warning: the strength 
of a response at the end of a period of punishment cannot be used as a reliable 
index of the true long-range strength of the response (68). Punishment is not a 
mechanical stamping-out process. Its effect must be considered in relation to 
the experimental situation. 

The emotional effect of punishment is also emphasized in Guthrie’s attempt 
to account for the elimination of wrong responses (90). For Guthrie, the funda- 
mental condition of all learning is conditioning by contiguity in time. It is 
movements, not acts, which are conditioned to stimulus cues. The more aroused 
the organism, i.e. the greater the emotional excitement, the more varied and 
intense the movements which occur. These varied and intense movements pro- 
duce a correspondingly high degree of proprioceptive stimulation. Propriocep- 
tive stimuli are important cues to which responses are conditioned. Hence 
punishment facilitates conditioning (of withdrawal responses, for example) not 
because punishment is annoying but because punishment arouses the organism 
and provides a multiplicity of cues for the establishment of conditioned re- 
sponses. Guthrie insists that the action of punishment cannot be in any way 
clarified by referring to the action of annoyers. Suppose we define annoyance as 
a state of affairs which the animal avoids or changes. “But this ability to avoid 
is just what is necessary to explain unless we assume that learning has already 
taken place. If annoyance means avoidance, we had to learn to be annoyed at 
annoyers (by emotional reinforcement). It is what the punishment makes an 
organism do that counts not what it makes him feel!” (90, p. 14). 

From what may be called stimulus and/or response explanations of punish- 
ment we turn now to explanations which put the main emphasis on the changes 
wrought by punishment in the subject’s approach to the learning situation an 
his general manner of performance. We are dealing here with qualitative de- 
scriptions of performance under punishment. Several investigators have 
stressed the attitude of caution which the subject assumes when he is punishe 
for incorrect responses. The learner is circumspect, he steps warily (10, 24, 25, 

26, 172, 173, 218). Muenzinger has found that a punishment or obstacle makes 
the subject pause before he proceeds with his response (196, 200, 202, 203). 
During this pause he is exposed to relevant stimulus cues. In this regard shock 
and an enforced delay (200, 203) or an obstacle such as a gap (202) after the 
choice-point are equivalent. According to Honzik and Tolman (116) a state 0 
heightened vigilance may make this increased exposure to stimulus cues espe 
cially effective. Thus punishment is described as making the organism more 
sensitive and vigilant, and as enhancing the animal’s opportunity to learn about 
the situation and indulge in successful vicarious trial-and-error. Similar qualita- 


tive observations describe punishment as raising the general incentive leve 
(57, 79, 80). p j S 

2, Explanation of effective punishment in terms of reward. When Thorndike 
jettisoned punishment as a condition of learning, he left open the possibility 
that it may have an indirect effectiveness by speeding up the occurrence of acts 
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which are rewarded. Thus punishment may be a condition for the eventual ap- 
plication of reward and in this way a determinant of learning, though one step 
removed from the locus of the critical event. Hull's theory of primary reinforce- 
ment which makes all learning contingent on need-reduction clearly implies 
such an approach to punishment. The reduction of the effect of punishment to 
the action of reward has been made explicit in the writings of Mowrer (181, 183, 
184, 185, 186, 188). According to the conception which Mowrer presents, all 
motives involve tension or discomfort. Thus there is discomfort or tension due 
to hunger just as much as there is discomfort or tension due to shock. In this 
sense, all learning, dependent as it is on motivation, involves punishment, the 
Punishment inherent in the motivating tension. Given the motivation (tension), 
earning occurs when the tension is reduced. Tension may be reduced either 
through the occurrence of a consummatory response (e.g., reduction of hunger 
by feeding) or through escape from punishment (e.g., escape from shock). In 
owrer’s own words, “ . . . it is therefore meaningless to say that one type of 
earning is ‘through reward’ and another type is ‘through punishment’; each is 
an essential aspect of a single dynamic process” (184, p. 424). Similarly 
uenzinger (199) argues that it is “poor logic and bad science” to compare 
learning by reward and learning by punishment. Without a primary state of 
Imbalance and its subsequent reduction there can be no learning. Punishment 
1s merely a way of increasing this imbalance or tension in order to make the 
subsequent need reduction more effective as a reinforcing state of affairs. 

Such a conceptualization assigns to punishment a much more important 
Systematic role in learning than does Thorndike’s. For Thorndike, the effects 
of punishment are variable and unpredictable. Punishment has no vital place 
in his picture of the learning process. In the view represented by Mowrer and 

Uenzinger, on the other hand, punishment is at the very fountainhead of 
¢arning. For punishment induces tension, and without tension the behavior 
Which leads to learning by reinforcement would not occur. Thus the law of 
effect again becomesa law of reward and punishment. It has, however, undergone 
a crucial transformation. Formerly either reward or punishment was believed 
© be a determinant of learning, and punishment was, as it were, reward with 
à negative sign attached to it. The modern version is a law of reward after 
Punishment, for punishment provides the stimulus which makes a reward a 
reward, Both reward and punishment are integral parts of the learning situa- 
10n, provided they occur in the proper sequence and are functionally connected 
With each other. This analysis of the role of punishment, which Mowrer has 
fended on the basis of his experimental work, has all the beauty and all the 
aint of simplicity. Here is an integrated, coherent account of learning, 
= aa the basic assumption that leariinite camm, vovi without initial tension and 
wii a tension-reduction. It is precisely the granting of this assumption 
Thus bas one of the controversies with which modern learning theory abounds. 
hr EA we have come in our discussion to the same parting of the ways to which 
whet Scussion of reward has led us. In which direction progress lies depends on 
‘ss er or not tension and tension-reduction can stand up as necessary (though 

Sufficient) conditions of learning. 
en Punishment or emphasis? In general, an individual is subjected to punish- 

roc aniy when he has committed a wrong. | In learning experiments, the same 
Si edure has usually been followed. The subject is punished when he commits an 
or. Sometimes it follows unequivocally from the nature of the situation what 
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constitutes an error (e.g., blind alley in a maze), at other times the experimenter 
decides arbitrarily which responses are to be considered right and which ones 
wrong (e.g., in learning word-number combinations in a Thorndikian experi- 
ment). Thus errors and punishment have been associated in investigations of 
the law of effect until Tolman, Hall and Bretnall (294) decided to explore the 
effect of punishment for right as well as for wrong responses. If punishment 
does, indeed, weaken associative connections, then punishment for correct 
responses should slow up the learning process, whereas punishment for wrong 
responses should result in faster learning. These expectations were not con- 
firmed. Tolman, Hall, and Bretnall found that (1) adding a shock to an audi- 
tory signal for right responses did not slow up learning significantly and (2) 
adding a shock to an auditory signal for wrong responses did not speed up 
learning but on the contrary slowed it down. Reinforcement of right responses 
was always more effective than reinforcement of wrong responses. These find- 
ings were so strikingly at variance with what the traditional law of effect would 
have predicted that the article reporting these findings was entitled “A disproof 
of the law of effect.”"” 

Tolman, Hall, and Bretnall believed that the reinforcing stimuli—both 
auditory signal and shock—do not stamp in or stamp out responses but rather 
serve to emphasize (perceptually or cognitively) the reinforced responses. 
Moreover, an emphasis upon correct responses favors learning whereas emphasis 
on wrong responses does not. Any relatively violent emotional stimulus may 
in addition have disruptive effects which will counteract the favorable influence 
Other experimenters took up the idea of administering punish- 
ment for right as well as wrong responses. The work of Muenzinger has already 
been referred to. In the first of his series of experiments he found that punish- 
ment for right responses and punishment for wrong responses were equally 
effective (193). In a subsequent repetition of his experiment he found shock for 
wrong responses inferior (196). In one experimental variation, shock was ad- 
ministered for correct choice, incorrect choice, and at the time of feeding. All 
three conditions accelerate learning significantly (57). As in other studies of 
punishment the intensity of the shock relative to the task is a crucial param- 


eter (87). 

The experiment of Tolman, Hall, and Bretnall was of considerable 
systematic significance because it shifted attention from punishment as 
an automatic mechanism to punishment as a perceptual event. The 
learning task may be regarded as a problem which the learner tries to 
solve. Anything which helps the learner to distinguish correct from in- 
correct responses facilitates learning. A punishment may merely serve 
to designate a given response as either right or wrong at the pleasure of 


the experimenter. 


of emphasis. 


INFORMATION VERSUS EFFECT 


The efficacy of punishment for right as well as wrong responses is 
closely linked to a general problem which has been repeatedly raised in 


12 Goodenough (84) criticized Tolman's findings on statistical grounds but later ex- 
periments have tended to confirm Tolman’s results (119, 214, 234). 
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discussions of the law of effect. Can the efficacy of rewards and punish: 
ments be explained in terms of the information which they convey to 
the subject about the correctness or incorrectness of responses? If the 
effect of reinforcement were always mediated by information, it would 
not be possible to speak of a direct effect of satisfaction (need-reduction) 
or annoyance on stimulus-response connections. Rather, the hypo- 
thetical sequence of events in the organism would then become: 
stimulus-response-reward or punishment-information-strengthening (or 
Weakening) of the association. In such a description, the mechanism 
by which information leads to the strengthening of associative connec- 
tions would be left open. One could, like Tolman, think of the formation 
of sign-gestalt expectations (288, 291) or assert that information leads 
to tension-reduction and that learning through information is only a 
Special case of the law of effect. The problem of effect versus informa- 
tion has not always been posed in all-or-none terms. Granting that both 
factors may be operative, investigators have been interested in gauging 
their relative importance. $ 

Information is a concept which does not easily lend itself to experi- 
mental ayalysis. There is always the vexing possibility that what the 
experimenter considers to be mere information may act as reward, and 
conversely that what the experimenter believes to be a reward (need 
reduction) may serve as a source of information. There may, moreover, 
always be self-administered information (229). Nevertheless, experi- 
Mental situations have been designed which at least made it possible 
to speculate about the role of information in the learning process, These 
experiments have been concerned with (1) clearly non-informative after- 
effects, (2) the question whether a learner need necessarily be aware of 
What he is learning, (3) the relevance of after-effects to the connections 


Which they strengthen. 


Non-informative after-effects. In our discussion of punishment ex- 
Periments were cited which show that punishment administered without 
Teference to a specific response facilitates learning. Similarly, rewards 
Siven after a block of trials and not following a particular correct re- 
SPonse have been found effective (60, 281). In general, however, effects 
Which give specific information have been found superior to non-inform- 
ative ones (60, 224, 298). 

The study of “neutral” after-effects has a direct bearing on the prob- 
lem of effect versus information. An after-effect is “neutral if it neither 
rewards, punishes, nor informs. Nonsense words, meaningful words, 

ashes of light, and clicks have been used as after-effects which merely 
appen after an S-R connection, presumably without telling the subject 
anything about the correctness of his response. Several investigators 
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have found that such after-effects strengthen responses to some extent 
(165, 166, 255). The very ambiguity of such after-effects has made it 
possible to interpret them in different ways. Thorndike and his co- 
workers believe that ambiguous after-effects may be interpreted as re- 
warding by the subject and hence function as rewards (165, 166). 
Stephens (255), on the other hand, suggested that the mere fact that 
something happens after a response, that the connection is “attended to” 
facilitates learning. Something happening after a response may, however, 
be interpreted by the subject as conveying information. “Neutral” 
after-effects have repeatedly failed to influence learning when the factor 
of information was carefully controlled (37, 38, 298). 

It seems that the use of neutral after-effects is a highly unsatisfac- 
tory procedure. The fact that an experimenter considers an after-effect 
neutral does not, of course, guarantee that the subject interprets the 
event in the same way. It seems puzzling that some experimenters first 
call an effect neutral, then turn the tables on themselves and assert that 
it was not neutral after all but rewarding. If it is neutral after-effects 
that we want to study, we should make sure that the effects are indeed 
neutral and then consider them neutral throughout. On the other hand, 
if we are interested in the effect of rewards, the study of ambiguously 
neutral after-effects provides a very devious avenue of approach to the 
problem. 

Awareness of what is being learned. If learning by reward can take 
place while the learner is not aware of what he is learning, then certainly 
effect and information could not be considered coextensive. Thorndike 
repeatedly claimed that rewards are effective even if there is little or 
no opportunity for inner rehearsal of the right responses (266, 269, 275), 
though he has been contradicted on that point (37, 298). Thorndike 
and Rock (282) then offered what they consider conclusive proof that 
learning without awareness of what is being learned can, indeed, take 
place.!? In their experiment, learning depended on the discovery of @ 
principle by the subjects. Yet there was only gradual improvement 
under reward and no sudden increase in successful responses. It seemed 
that subjects achieved insight without knowing that they did. The 
interpretation of this experiment depends entirely on the assumption 
that gradual improvement indicates lack of awareness. It turned out, 
however, that even subjects who are explicitly taught the principle on 
which their learning depends still may show gradual improvement (131). 
The point here is that understanding a principle and using it are two 
different things. Slow improvement may reflect lack of insight but also 
may be due to the learner's inability to translate his understanding into 
smooth, efficient action. This attempt to prove learning without aware- 
ness must be considered doubtful. On the other hand, positive evidence 


13 For other experiments demonstrating that subjects can learn without being aware 
of what they learn, see Thorndike (269, 276). ‘ 
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has been offered for the importance of awareness (104) and knowledge 
of results (7, 82, 140, 225) for successful learning. Whether or not learn- 
ing without awareness of what is being learned can take place must re- 
main an open issue. 

The role of relevance. A satisfying after-effect, according to Thorn- 
dike, does not act logically but in a mechanical way. Itis likened to a 
natural force applied to do work and need not, therefore, be relevant to 
the activity of the organism at the time at which the reward is given. 

Uppose a subject is engaged in activity directed toward goal A. In the 
Course of his trial-and-error behavior he is given a reward which leads 
Not to goal A but to an irrelevant goal B. Nevertheless the response 
which led to the reward is reinforced. Such findings have been reported 
for both human and animal subjects (164, 271, 274, 276), casting doubt 
on the theory that rewards function primarily as sources of information 
relevant to the learning task. How conclusive are such experiments on 
the role of relevance? Relevance and irrelevance are terms which refer 
to the experimenter’s interpretation of the situation. The subject's in- 
terpretation and the wants under which he operates are an altogether 
different matter. The only conclusion which can be drawn from such 
€xperiments is that responses which occur in temporal proximity toa 
reinforcing state of affairs are strengthened (Hull). Relevance is a 
Normative term reflecting the experimenter’s judgment. , 

Even if we agree that relevance is a useful dimension for the descrip- 
tion of reward, the evidence on the role of relevance remains inconclu- 
sive, There are strong indications that, at least in some experimental 
situations, rewards irrelevant to the subject’s motivation fail to facili- 
tate learning. The work of Wallach and Henle (305, 306) is a case in 
Point. Ina typical Thorndike situation (learning word-number combi- 
nations) the subjects were informed that they were participating in an 
Xperiment on extra-sensory perception and that responses called right 
On a given trial might or might not prove correct on subsequent oc- 
©sions. These instructions rendered the rewards irrelevant to the sub- 
Ject’s task—guessing numbers by ESP. Asa result, rewarded responses 
Were not repeated more frequently than punished ones, nor did these 
€xPerimenters obtain a level of repetition of wrong responses at all com- 
Parable to that reported by Thorndike. Wallach and Henle conclude 

at “with the learning motive eliminated no automatic effect of reward 

Sems demonstrable.” Their argument is further strengthened by the 
act that a change in instructions, telling the subject that responses 
Would no longer vary in a random fashion caused a highly significant 
Mcrease in the number of rewarded responses that were repeated. 

he results reported by Wallach and Henle strongly suggest that 
rewards may not act as illogically and blindly as Thorndike’s findings 
indicated. The subject’s attitude toward the learning situation néeds 

€ taken into account. In the absence of explicit instructions not to 
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learn, subjects will by and large instruct themselves to learn and utilize 
whatever cues the situation provides. A reward, even an irrelevant re- 
ward, is such a cue because a rewarded response is different from the 
majority of responses which remain unrewarded. Thus there is a tend- 
ency to follow rewards and to consider them relevant if not dissuaded 
by instructions to the contrary. Wallach and Henle have clearly shown 
that a change in attitude can resist the ‘‘mechanical” impact of a re- 
ward. Investigators of human conditioning have long recognized the 
importance of subtle attitudinal factors (108). Attitudinal factors are 
probably of equal importance in the study of S-R connections by Thorn- 


dike’s technique. 


The issue of information versus effect remains unsettled. The diffi- 
culty is that it has not been possible to prove conclusively that rewards 
do not yield information even when experimenters hope and believe 
that they do not. An experimenter provides what he considers a non- 
informative reward but the subject may extract information from it, 
especially if he is a subject capable of symbolization. i 


THE SPREAD OF EFFECT 


There is one pehnomenon which more than any other has bolstered 
the view that rewards act mechanically and blindly. This phenomenon 
is the spread of effect. Again the pioneer investigation was carried out by 
Thorndike. In 1933 he published evidence that a reward strengthens not 
only the connection which it directly follows and to which it belongs 
but also the connections which precede and follow the rewarded response 
(270). The closer in the series an item is to the rewarded connection 
the more it benefits from this spread of effect. Thus there is a double 
(before and after) gradient of effect, with the items preceding the rewar d 
showing somewhat less strengthening than those following the reward. 
Degree of spread depends primarily on the number of serial steps sepa- 
rating an item from the reward and not so much on sheer tempora 
proximity. a 

Thorndike considered this finding an independent proof of the law of 
effect (272). The discovery of the spread phenomenon gave Thorndike 
new confidence not only on backward action but also in the mechanical 
action of satisfiers. Again in his own words: É 


The satisfier acts (upon a neighboring punished connection) unconsciously 
and directly, much as sunlight acts upon plants, or an electric current upon ® 
neighboring current, or the earth upon the moon. From a satisfier issues a 
hening force which the connections absorb. Which of them will absor 


strengt cs 
it depends on the laws of nature not of logic and teleology (270, p. 48). 
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As to the particular mechanism responsible for the spread, Thorndike 
entertained two hypotheses: (1) the scatter hypothesis: the strengthening 
effect, being not logical but a biological force will sometimes miss its 
mark, striking preceding and succeeding connections. The gradient thus 
Tepresents the decreasing probability of chance errors in the action of 
the satisfier. (2) The spread hypothesis: the confirmatory reaction caused 
by the reward may be diffuse, spreading out its influence over a range 
of items. 

Thorndike’s findings have been repeatedly confirmed with both 
human and animal subjects (15, 72, 134, 135, 136, 197) and the phe- 
nomenon became known as the ‘Thorndike effect.” Agreement ceases, 
however, when it comes to an interpretation of the phenonenon. Dis- 
Cussion has centered around a number of related questions: (1) Is the 
Sradient of effect really double-winged or is the before gradient spurious? 
(2) Is the gradient of effect a gradient of variability? (3) Is the gradient 
a result of perceptual emphasis on the rewarded response rather than 
of the mechanical action of satisfiers? (4) Is the gradient of effect really 
à gradient of response habits? 


1. Is the gradient of effect bidirectional? For Thorndike one of the 
Most important characteristics of the spread of effect is its bidirection- 
ality; rewards exert their influence both in the forward and backward 
ni aton; The interpretation of the before (backward) gradient runs 

° a methodological difficulty, however. A connection which precedes 
3 reward also follows a reward. In Thorndike’s experiments rewards 

ten followed each other rather closely. In criticizing Thorndike’s data 
“ton therefore raised the question whether the backward gradient 
Might not be a spurious function of a large forward one (286). He also 
Pointed out that serial position needs to be taken into account with this 
"Ds of material. Putting these considerations to the test-—taking serial 
Position into account and attempting to eliminate the influence of the 
davard gradient from the backward gradient—Tilton still found a 
mae gradient but from both success and failure. In the case of the 
ure gradients, the backward one is more pronounced, while the for- 
eee gradient is the more pronounced in the case of success. Other in- 

igators also report the existence of a failure gradient (197, 332). 
that (ouble-winged gradient is thus confirmed, it is true, but we find 
has effect of Wrong spreads as well as the effect of Right. These 

. s render doubtful Thorndike’s theory of the scatter or spread of 
that o ftming reaction. As an alternative explanation Tilton suggests 
5 connections which the experimenter regards as discrete are not 
i a fonally separate. Items in a series may be sufficiently unified to 
ty ected by success and failure as a unit (286). It is not entirely clear, 
ever, why such “sequential unity” should manifest itself as a double 


p 


Ga 


518 LEO POSTMAN 


gradient. Further doubt is cast on this hypothesis by Zirkle’s finding 
that the rewarded item and the adjacent punished ones need not be 
qualitatively similar in order for the Thorndike effect to appear (331). 
Even more strikingly, Zirkle has demonstrated that it is the response 
adjacent to the rewarded item which is strengthened, not the adjacent 
S-R connection. When the relative order of wrong series items about 
a right item is shifted from one presentation to the next, there is no 
Thorndike effect when repetitions of S-R connections are counted. How- 
ever, a clear-cut Thorndike effect appears when repetitions of responses 
are counted by step-position alone without regard to the shifting po- 
sitions of the stimulus items. It has also been found that reward 
strengthens the early wrong responses to near-by stimuli not merely the 
last response (174). 

2. Spread of effect as a gradient of variability. There have been other 
reformulations of the Thorndike effect. Muenzinger and Dove (197) 
disposed of the scatter hypothesis which ascribes the spread of effect 
to inaccurate placement of reward and consequent uncertainty of recall. 
A gradient is present, with a steeper slope than usual, even if the suc- 
cessful response is learned beforehand so that there can be no possible 
confusion between the correct response and the surrounding wrong ones. 
For a gradient of uncertainty Muenzinger and Dove substitute a gradi- 
ent of uniformity or variability. Success produces a gradient of uni- 
formity: this is in essence the original Thorndike effect. Failure pro- 
duces a gradient of variability: not only does the wrong response itself 
tend to be varied but right responses near the wrong one are not re- 
peated with the same degree of uniformity as those farther away in 
the series. Spread of variability under punishment is also reported by 
Stone (260). This analysis still leaves open the question as to what 
intervening mechanism the changes in variability should be ascribed. 

3. Is the effect gradient due to perceptual organization? The system- 
atic differences in approach to the problem of effect in general are neces- 
sarily reflected in interpretations of the spread of effect. Again we find 
those who regard rewards and punishments primarily as perceptual 
events opposing those who think of the mechanical action of satisfiers 
and annoyers. The conditions under which the Thorndike effect gener- 
ally appears are especially favorable to an analysis in perceptual terms. 
The rewarded item is an isolated Right in a long homogeneous series 0 
Wrongs. The announcement of Right may be regarded as a figure against 
a homogeneous ground of Wrongs. Wallach and Henle were the first to 
suggest that it is the extreme crowding of items which interferes wit 
memory for wrong responses and contributes to the subject’s tendency 
to repeat them (305, 306). The relation between perceptual isolation 
and the Thorndike effect was systematically investigated by Zirkle 
(332). He found a clear-cut positive relation between the degree v 
isolation of a rewarded response and the steepness of the effect gradien 
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(especially the after gradient). He also found a failure gradient around 
a perceptually isolated wrong response. On the basis of his experimental 
results Zirkle proposes a theory of isolation to account for the Thorndike 
effect. “A satisfier isolates tendencies which are at hand when it hap- 
Pens ... responses neighboring upon a key response tend to become 
tsolated themselves because of their association with the key response” 
(332, P. 312f.). A theory of isolation is, of course, a perceptual theory of 
the spread of effect. That which stands out in a homogeneous series be- 
comes a focus of retention (similar to the von Restorff phenomenon). 
‘his interpretation is closely akin to Tolman’s law of emphasis. Thorn- 
dike and Zirkle thus clearly represent the opposition between an inter- 
Pretation of the effect gradient in terms of ‘‘hit-and-miss” mechanical 
reinforcement on the one hand and a view stressing the laws of per- 
Septual organization on the other. 
. 4. Is spread of effect a gradient of response habits? In terms of an 
Orthodox connectionist analysis the Thorndike effect indicates that the 
influence of reward spreads to neighboring S-R connections. It is the 
-R connection that is the basic unit of analysis. It is possible, however, 
to account for at least part of the effect gradient by an analysis of the 
response sequence, disregarding as it were the stimulus side of the S-R 
connection, We have already referred to Zirkle’s finding that a Thorn- 
ike effect can be demonstrated when responses are counted by step- 
Position eyen though the stimulus order is changed from presentation to 
Presentation, The problem of response habits was systematically at- 
tacked by Jenkins and Sheffield (136). They found that (1) when the 
rewarded response itself was not repeated few adjacent errors were re- 
Peated and the Thorndike effect failed to appear; (2) when the re- 
Warded response itself was repeated it was accompanied by a high level 
+1 Tepetition of errors and a typical effect gradient. Thus repetition of 
© rewarded response appears to be a necessary condition for the ap- 
pearance of the Thorndike effect. The spread of effect may therefore 
ue not to the automatic action of reward but to the subject’s tend- 
ncy to repeat the same sequence of responses from trial to trial, i.e. the 
subject's “guessing habits.” Responses are not independent of each 
Other: choice of a response depends on the preceding responses. A 
epeated rewarded response ensures that “the errors following reward 
Will be frequently preceded by the same response, i.e., will have a com- 
oo antecedent stimulus.” It is interesting to recall in this connection 
a Preliminary rehearsal of the rewarded response yields an especially 
“ Cep effect gradient (197) and that, on the other hand, a gradient fails 
i aPpear when the subject is not motivated to repeat the rewarded 
og ponse (305, 306). Guessing sequences have also been reported with 
her types of multiple-choice responses (72), and there seems little 
ae that at least part of the Thorndike effect can be analyzed in these 
tms. The demonstration of guessing habits does not necessarily dis- 
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prove the automatic spread of reward as conceived by Thorndike. It 
may well be that both some “stamping-in” mechanism and response 
habits interact in the production of the Thorndike effect. 


As in the case of other conditions of reinforcement, it is necessary to 
stress the dependence of the spread of effect on the parameters of the 
experimental situation. As we have seen, the Thorndike effect fails to 
appear when subjects lack the motivation to learn (305, 306) though 
distraction does not seem to affect it (136). In animal subjects, increase 
in drive results in a higher frequency of repeated responses in general, 
with a consequent flattening of the after-gradient (134). Similar re- 
sults are obtained with an increase in incentive (135). Under such con- 
ditions, only responses preceding the reward yield a statistically reliable 
gradient. Such delicate dependence of the gradient on particular experi- 
mental conditions counsels against hasty analogies between the action 
of satisfiers and the ‘‘action of the earth upon the moon.” 


PARAMETRIC STUDIES 


We have repeatedly stressed the functional dependence of the effects 
of reinforcement on the parameters of the experimental situation. We 
now turn to a consideration of studies that are primarily concerned with 
the influence of such parameters of reinforcement. No exhaustive survey 
will be attempted but the major types of functional relationships will 


be illustrated. 


Amount of Reinforcement 


Thorndike’s original statement of the law of effect included the as- 
sertion that ‘“‘the greater the satisfaction or discomfort the greater the 
strengthening or weakening of the bond.” Demonstration of the corre- 
lation between amount of reinforcement and degree of learning has 
proved much easier with animal subjects than with humans. The moti- 
vation of animal subjects is more easily controlled and hence quantita- 
tive variations in reward are more effective. Increasing the food ration 
of a hungry rat affects behavior more drastically than adding 0.4 cent 
to the announcement of Right in a Thorndike experiment. 

Turning to animal studies first, there are a number of experiments 
with different species and different learning tasks which support the 
generalization that increases in reward lead to improvements ın learning 
and performance. Chickens (86) as well as rats (42) run downa runway 
at a faster speed when the amount of food reward is increased. Incre 
ments in reward also cause chicks to learn a maze with fewer errors 
(320), rats to show greater resistance to the extinction of a conditione 
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response (77), and chimpanzees to tolerate longer delays between the 
Presentation of the stimulus and the response (208). Although the 
functional relationship between amount of incentive and performance 
varies from situation to situation, it is clear that the relationship is not 
linear, Equal increments in incentive do not lead to equal increments in 
Performance. Thus Crespi, who was especially concerned with the 
quantitative relation between amount of incentive and performance re- 
Ports a sigmoid relationship over a wide range of variation (42). Hull, 
On the other hand, believes that the relationship is best represented by 
a simple positive growth function (127). 
_ The mechanism by which increments in reward exert their influence 
's still under discussion. Hull has suggested that an increase in the 
amount of reinforcement raises the limit to which the curve of habit 
Strength approaches as an asymptote, although the rate of approach 
may be constant for all amounts of reinforcement. Hence increases in 
the amount of the reinforcing agent result in greater increments of habit 
Strength per reinforcement. Hull also points out that in a conditioning 
Situation the reward provides an important component of the stimulus 
Situation which is conditioned to the response being reinforced. In such 
a situation a large reward stimulus evokes stronger, more vigorous, and 
More persistent responses than a small reward stimulus. Thus an in- 
rement in reward not only increases the amount of consummatory ac- 
tivity (need reduction) but also provides a more distinctive cue to the 
animal being conditioned (127). In this connection it is important to 
‘stinguish between sheer physical amount of reward and the amount of 
activity involved in the consumption of the reward. Wolfe and Kaplon 
Were able to show that amount of reward and amount of consummatory 
activity are experimentally separable conditions of learning. Learning 
‘Mproves asa function of sheer amount of reward: a whole kernel of corn 
a more effective incentive for chickens learning a maze than one- 
Ourth of a kernel. However, learning also improves if the amount of 
“onsummatory activity is increased for a constant reward: when the 
Whole kernel of corn is divided into four separate quarters the maze is 
tarned better than with the equivalent amount given in one piece. Of 
the two factors, amount of consummatory activity influences learning 
4 greater extent than sheer amount of reward (320). . 
€ role played by amount of reinforcement has also been explained 
ms of the subject’s attitude rather than in terms of differential 
8thening of S-R connections. Crespi has proposed a two-factor 
"y of incentive-value. He conceives of incentive-value as propor- 
al to the distance between the subject’s level of expectation (both of 
“antity and quality of reward) and the level of attainment. Attain- 
fi “nt which does not reach the level of expectation is frustrating and af- 
&cts learning adversely, while attainment above expectation is ‘“‘elating 
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and improves learning. It is not the sheer amount of reward alone which 
is decisive nor the animal’s expectancy alone but rather the relation 
between the two. A striking illustration of this principle is the fact that 
rats will perform significantly better with no incentive at all than with 
a very small incentive. A very small incentive serves to whet the ani- 
mal’s appetite, raises his level of expectation and eventually leads to 
frustration. In accordance with the same principle, downward shifts in 
the amount of incentive lead to poorer and more variable performance 
whereas upward shifts improve the performance (42, 42a). Crespi’s con- 
ceptualization is reminiscent of the experiments on level of aspiration of 
human subjects. That the effectiveness of a given quantity of rewar 

cannot be evaluated without reference to the subject's expectancy i 

also stressed by Cowles and Nissen (40) on the basis of experiments 
on delayed responses in chimpanzees. Similarly Nissen and Elder 
(208) report that increases and decreases in amount of incentive affect 
not only the response within a given trial but succeeding trials as well. 
Such perseverative effects suggest the operation of reward expect- 
ancy. 

When human subjects are used, comparisons of different amo 
of reinforcement yield variable and inconclusive results. Slight incre 
in rate of learning as a function of increases in reward—addition © 

small money gains to the announcement of Right—have been found in 
some Thorndikian experiments (60, 223, 280). The effects are exceed- 
ingly slight if the amount of reward varies within the same series (223) 
and are somewhat more pronounced if incentives are changed from 
series to series (280). On the other hand, promise of a reward was found 
as effective as actual administration of the reward (78). In situations 
of this type, the experimenter never knows whether an increase in re- 
ward is experienced as such by the subject. Moreover, if different re- 
wards are presented in the course of an experiment, their effects may 
interact with each other. Whatever effects there are, may possibly not 
stamp in individual responses more firmly but rather affect the general 


level of the subject's motivation and thus affect learning secondarily 
(171). 


unts 
ases 


Frequency and Pattern of Reinforcement 


The simplest hypothesis relating frequency of reinforcement with 
strength of learning is that each reinforcement adds to the strength of 
the S-R connection being reinforced. This hypothesis is plausible, at 
least at first blush, in the light of empirical data. The percentage of 
conditioned responses usually increases as a function of the number © 
reinforcements (108), human rote learning steadily improves on succes” 
sive trials (171). Indeed, it is difficult to think of learning situations in 
which an increase in number of trials does not result in better perform- 
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ance.“ Frequency per se cannot be profitably considered a significant 
condition of learning. Rather, repeated pairings of stimulus and re- 
sponse allow effective conditions of learning, (such as simultaneous con- 
ditioning, reinforcement by need reduction, confirmation of expecta- 
tions, etc.) to exert their effects. There would probably be little dis- 
agreement with such a general formulation of the role of frequency. 
Agreement ceases when the assertion is made that each individual rein- 
forcement in a series of reinforcements contributes a differential incre- 
ment (say, AgHx in Hull's language) to the existing habit strength. The 
theoretical question at issue is whether or not the effects of successive 
reinforcements are additive and cumulative. Is the law of effect a law 
of cumulative effect? 
, The assumption that the effects of successive reinforcements con- 
tinuously cumulate in time is central to Hull's theoretical system (127). 
© conceives of habit strength as a monotonic increasing function of 
the number of reinforcements. As habit strength approaches the sub- 
Ject’s physiological limit, the increment from each reinforcement pro- 
gressively decreases in magnitude. Thus habit strength is a simple 
Positive growth function of the number of reinforcements. Habit 
Strength, as Hull uses the term, is a theoretical construct which can be 
measured only indirectly through its behavioral manifestations. There 
are a number of behavioral studies whose results are consistent with 
ull’s quantitative picture of the growth of habit strength: 

1. Reaction amplitude increases as a function of the number of reinforce- 
ments (118), 

2. Reaction latency decreases with the number of reinforcements (235). 

3. The number of trials required for experimental extinction may be pro- 
Portional to the number of reinforcements (209, 317, 329). 

4. The more frequently a response has been reinforced the greater the 
Probability that the appropriate stimulation will evoke that response (162, - 
269, 276, 320). 


Unfortunately not all experimental results available fit into Hull’s 
theoretical picture. Leeper (160) has criticized Hull for what he con- 
Siders a biased selection of illustrative experiments. There is substantial 
©xperimental evidence that partial reinforcement, i.e., reinforcement on 
Only a fraction of the trials rather than on each trial may be at least as 
“flective as continuous reinforcement. Thus Humphreys showed that 
reinforcement on 50 percent of the trials is as effective in the establish- 
Ment of a conditioned response as reinforcement on 100 percent of the 


= A notable exception is Skinner’s finding that a single reinforcement results ina 
Ries of lever pressings by the rat at an optimal rate (237). One reinforcement is suf- 
Slent to establish an adequate response strength. Further reinforcements serve to build 
a reflex reserve, which is subsequently emptied in the absence of reinforcement 


(extinction), 
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trials and that partial reinforcement may result in greater resistance to ex- 
tinction than continuous reinforcement (128, 129). Humphreys believes 
that conditioned responses occur to the extent that the subject expects 
the reinforcing stimulus to follow the conditioned stimulus, extinction 
occurs to the extent that the subject no longer expects reinforcement. 
Partial reinforcement during the initial training period makes it difficult 
for the subject to shift from expectation of reinforcement to expectation 
of non-reinforcement. In terms of the expectancy hypothesis it is neces- 
sary clearly to distinguish between frequency of trials and frequency of 
reinforcements as conditions of learning. The relation between these 
two frequencies determines the subject’s expectations and hence the 
course of acquisition and extinction of responses (130). The equal ef- 
fectiveness of partial and continuous reinforcement has, however, also 
been ascribed to the influence of secondary reinforcement during the 
ostensibly unreinforced trials (52). Such an explanation would be in 
conformance with a description of the reinforcement process as con” 
tinuous and cumulative. The relationship between number of trials an 

relative frequency of reinforcement is complicated by the phenomenon 
which Hovland has called “inhibition of reinforcement” (117). A mast- 
ing of reinforcements results in a weakening of conditioned responses, 
with spontaneous recovery within a short interval of time. Inhibition > 
reinforcement as well as expectancy have to be taken into account in 
the analysis of partial reinforcement (75, 76). 

The studies cited thus far by no means exhaust the evidence for the 
equal effectiveness of partial and continuous reinforcement. Brogden 
(16) showed that conditioned flexion to shock was established as readily 
with 40 percent reinforcement as with 100 percent. Decrease in the 
frequency of reinforcement had the positive effect of eliminating 4 great 
deal of the animal’s diffuse and restless behavior. When food rather 
than shock was used as the reinforcing stimulus substantially the same 
results were obtained, except that there was a slight decrease in the 
frequency of response (CRs on ¢ of the trials) when reinforcement W 
applied only 20 percent of the time. Brogden concludes that applicatio” 
of the reinforcing stimulus serves primarily as an incentive to the subject: 
and thus a low frequency of reinforcement may maintain the condition 
response at a high level. 

Mowrer and Jones (187), who confirmed Humphreys’ findings, pre” 
sent an argument reconciling these results with the law of effect. The 
effects of a reward need not necessarily be restricted to the particular 
response that occurs just before the reinforcement. The reinforcem® y 
applies to preceding responses as well (though to a decreasing extent): 
If we think in terms of response units (sequences) each of which is fol- 
lowed by a reward rather than in terms of individual responses whic? 


as 


sometimes are reinforced and sometimes are not, the apparent adv. 
tage of intermittent reinforcement disappears. On the contrary: 


ee 
2 e a —_ 


PRESENT STATUS OF THE LAW OF EFFECT 525 


terms of a response-unit analysis, the intermittently reinforced group 
which has to expend more effort in order to obtain a reward gives fewer 
extinction responses.!® The response-unit hypothesis of Mowrer and 
Jones points to the importance of considering the temporal pattern of 
reinforcement and not only the sheer frequency. Skinner's results with 
Periodic reconditioning are a relevant case in point (237). When the 
lever-pressing response of the rat is reinforced periodically, e.g., every 
three minutes, the animal's rate of response not only tends to become 
uniform, but also the more frequent the reinforcement the more rapid is 
the rate. On the other hand, when reinforcement is at a fixed ratio, i.e., 
when the final member of a fixed number of responses is reinforced, the 
less frequent the reinforcements the higher is the rate of response. Such 
laws of operant behavior can be analyzed only in terms of the total 
temporal pattern of responses and reinforcements (e.g., Skinner’s con- 
cept of reflex reserve). The importance of the temporal pattern is also 
illustrated by Brunswik’s finding that in discrimination learning proba- 
ility of success is an important determinant of the animal's behavior at 
a choice point (21). 

In summary, it is clear that frequency of reinforcement is an impor- 
tant determinant of the strength of learning. However, partial rein- 
forcement can be as effective as, and more effective than, continuous 
reinforcement, Reinforcements are not always simple additive units, 
and the temporal pattern of a series of responses and reinforcements gives 
rise to behaviors which cannot be predicted in terms of a simple mono- 
Jorie relationship between frequency of reinforcement and strength of 

arning, 


Delay of Reward and Gradient of Reinforcement 


The degree to which reinforcement strengthens a response depends 
in part on the time interval that elapses between response and rein- 
Orcement., Experimental inquiry has been directed at two interde- 
Pendent problems: (1) the difference between immediate reinforcement 
and delayed reinforcement and (2) the exact quantitative relationship 

etween length of delay and strength of learning, i.e., the nature of the 
Sradient of reinforcement. A ] 
horndike early suggested that satisfying states of affairs are the 
More effective the closer they are temporally to the S-R bond. ‘Other 
fe gs being equal, the same degrees of satisfyingness will act more 
Strongly on a bond made two seconds previously than on one made two 
Minutes previously ./. . ” (265, p. 172). A few of the early investigations 
of delayed reward failed to bear out this prediction (309, 313), but it is 
Now clear that in these experiments secondary reward was not con- 


hat the law of effect is compatible with 
ment of an expectation may serve as a 
\ 


Me Mowrer and Jones (187) also point out t 
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trolled—the animal subjects were delayed in the food chamber of a maze 
—-so that there was no effective delay of reinforcement. When these 
first attempts are discounted, the experimental literature shows general 
agreement on the superiority of immediate over delayed reward. The 
detrimental effects of delayed reinforcement have been demonstrated 
with animal subjects in maze learning (34, 100, 318), problem box learn- 
ing (222), the formation of discrimination habits (35, 321), and in the 
establishment and maintenance of operant responses (210, 211, 237). 
Similar findings have been reported for delayed punishment (32, 141, 
192, 308, 311, 321, 323). 

Different investigators used different periods of delay but there were 
strong indications that the most serious detrimental effects were con- 
centrated in the first minute of delay. Formal quantification of the 
functional relationship between length of delay and strength of learning 
are found in the writings of Hull (127) and Perin (210, 211). On the 
basis of theoretical calculations, which fit a considerable amount of 
empirical data, Hull concludes that (1) habit strength is a negative 
growth function of the time separating the response from the reinforce- 
ment and (2) the asymptote of this gradient is zero, i.e., with a suti- 
ciently long delay reinforcement becomes ineffective. In the case of the 
rat, the gradient reaches zero at a delay of about 30 seconds (210). This 
rather short gradient of habit strength constitutes the gradient of rein- 
forcement." It is important to bear in mind that the gradient of rein- 
forcement refers to the effects of different intervals between single re- 
sponses and that it does not apply to one of a series of responses leading 


ommon reward (108). In the case of a series of responses, e.g+1 11 @ 


toac 
nerated 


maze, the gradient is more extended and more complex, and is ge 
by the “summation of an exceedingly complex series of overlapping 
gradients of reinforcements, in part consisting of, but largely derive 

from, the ‘primary’ reinforcements occurring at the end of the tempora 
period covering the behavior sequence involved” (127, p. 143). In other 
words, more and more members of a stimulus series acquire (secondary) 
reinforcing power, each probably in accordance with the short gradient 
of reinforcement. The summation and interaction of these short gradi- 
ents result in the extended goal gradient. Hull believes that the g02 
s an exponential or negative growth function (not, as he ha 

believed, a logarithmic function). The greater the influence 
reinforcement, the less steep the slope of the goal gradient, 
the temporal range over which a reinforcement can exer 


gradient i 
originally 
of secondary 
i.e., the greater 
its influence. 


16 Skinner (237) fou 
whereas delays up tO 


pressing response. 
11 The expression 


nd that delays are detrimental only after periodic reconditioning 
4 seconds did not affect the original conditioning of the lever 


gradient of reinforcement was first used by Miller and Miles (178). 
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Hull’s goal gradient hypothesis (120, 122) has proved to be a power- 
ful deductive tool by means of which a considerable amount of empirical 
results could be predicted, sometimes with striking accuracy. Among 
the findings which the hypothesis predicts and which were empirically 
confirmed are the following: 


1. With number of reinforcements held constant, when the delay of rein- 
°rcement is short, less time is required to execute a response than under condi- 
Hons of long delay of reinforcement (6). 

2. Of a pair of alternative acts the one which is reififorced with a shorter 
delay is chosen (5, 226). 

3. Other things being equal, the greater the difference in the delays of re- 
inforcement yielded by two alternative reactions, the more quickly the animal 
Will learn to choose the act yielding the shorter delay (5). 

4. When the absolute differences in delay are equal, differentiation between 
two short delays is achieved faster than differentiation between two long de- 
ays. For example, a 30-second delay is more readily differentiated from a 60- a 
po delay than a 60-second delay is from a 90-second delay (5, 85). On the 
asis of the goal-gradient hypothesis Hull has also been able to predict the 
effects of delays of equal relative, but of different absolute, duration (327). 


. The gradient of reinforcement in Hull's treatment is, strictly speak- 
Ng, a temporal gradient. As Hilgard and Marquis (108) point out, such 
temporal gradients should be clearly distinguished from non-temporal 
8radients which refer to the spatial distance between a response and the 
reinforcement or the serial position of a response with respect to the 
sii orcement. Spatial separation or remoteness in a series, of course, 
ae temporal delay since it takes time to traverse the space leading 
mo goal or the members of a series ending in reinforcement. On the 
ee hand, an animal delayed in a restraining compartment is in a 
i ry different situation from a subject that has to cross a runway in 
rder to reach a food reward. The main difference lies in the fact that 
€ Spatial interval is filled with a series of acts leading to the reinforce- 
ent whereas sheer temporal delay may serve to disrupt the integration 
a behavior sequence. The rat’s speed-of-locomotion gradient (122) in 
Straight alley is a spatial gradient, with running speed plotted against 
Uccessive segments of the path leading to the reward (6, 122) or punish- 
ent (23, 176). Similarly, the extensive experimental investigations of 
mon 80al gradient in maze learning (120, 238, and many others) and of 
3 Imals’ ability to discriminate short from long paths to reward (35, 50, 
aan 327) should be considered primarily as studies of the xon-temporal 
tee of the gradient of reinforcement. In such experiments, the 
a cial experimental variable is the distance between a segment of the 
PParatus (runway, maze) and the locus of reinforcement. This dis- 
„ance involves not only a temporal delay but also a complex sequence of 
tvening acts. j 
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A notable example of a spatial gradient is, of course, the Thorndike 
effect. The close kinship between the Thorndike effect and the goal- 
gradient was demonstrated by Muenzinger, Dove and Bernstone (198). 
In an “endless”? maze (four identical mazes arranged as the sides of a 
square, with food boxes at the four corners) a double-winged gradient of 
elimination of errors was obtained. These authors believe that the goal- 
gradient is bidirectional in its fundamental form and that the usual 
backward elimination of errors in a maze reflects only the first half of 
the gradient The other half cannot manifest itself because the animal’s 
activity usually ends at the goal. This analysis has not remained undis- 
puted. Hill (109) reports that the forward gradient appears only late in 
learning and ascribes it to the failure of anticipatory errors to be elimi- 
nated. In a more recent investigation, however, Thompson and Dove 
again report evidence for a basically bidirectional goal-gradient (262). 
With human subjects, gradients of reinforcement have usually been 
plotted as a function of serial position rather than in terms of sheer 
temporal delay. The various studies of the Thorndike effect are a case 
in point: the spread of effect is most clearly demonstrated when fre- 
quency of repetition is plotted against distance from reward in terms of 
response units (15, 270, 331). On the other hand, when time alone is 
considered, a 6-second delay is found to be as beneficial to learning as 4 
0-second delay of reinforcement, and there is no evidence for a temporal 
gradient (167). The activity filling the interval between response an 
reinforcement is an important factor: when an announcement of Right 
follows an interval filled by another response, i.e., if the reward refers 
to the next to last response, the reinforcement is virtually ineffective. 
e and reinforcement must “belong” together. (Such detri- 
mental effects of interpolated activities are, of course, well known in the 
study of retroactive inhibition.) The fact that the gradient of reinforce- 
ment, which is readily demonstrated with animals, cannot be easily ap- 
plied to human subjects is of considerable theoretical importance. With 
the aid of symbols the human learner can bridge temporal gaps that are 
prohibitive to the animal (192). Such apparent independence of im- 
mediate reward has considerably complicated the application of the law 


of effect to human learning. 


Strength of Drive 
The modern version of the law of effect equates effect to drive re- 
duction. The strength of the drive could be expected to affect the opera- 
tion of the law in two ways: (1) strength of the drive could be one of the 
determinants of the speed of acquisition and (2) performance (habit evo- 
cation) may vary with the strength of drive. 
What is the influence of the strength of drive at the time of learning? 
( l investigation by Finan (73) showed that with the num- 
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ber of reinforcements constant an instrumental act is established more ` 
strongly when the drive is strong than when it is weak. A stronger drive 
yields a stronger habit although the relationship is by no means linear. 
This generalization has, however, not remained unchallenged. In Hull’s 
theoretical system, the course of acquisition (building up of habit 
strength) is described as independent of the strength of drive at the time 
of learning. Habit strength is a joint function of number of reinforce- 
ments, the time for which the conditioned stimulus has been acting be- 
fore the occurrence of the response to be learned, the time interval be- - 
tween response and reinforcement, and the magnitude of the goal object 
(127, 144). Drive strength is not one of the variables of which habit 
Strength is a function. In a recent study Kendler (145) has justified this 
Omission of drive strength as one of the determinants of habit strength. 
n one of his experiments, animals learned a bar-pressing response under 
different degrees of thirst deprivation. The animals were, however, 
equated in strength of motivation during extinction. The results show 
that different drive conditions at the time of learning had not influenced 
the amount of habit growth. Kendler also showed that a group of sub- 
Jects which had a low drive strength during learning but received a 
arge number of reinforcements established a stronger habit than a 
Matched group which learned under high motivation but received a 
Smaller number of reinforcements. The results reported by Finan and 
Kendler are contradictory and the role of the strength of drive during 
acquisition must remain open. 

, There is general agreement, on the other hand, that the strength of 
drive is an important determinant of performance. According to Hull 
Teaction-evocation potentiality is a multiplicative function of habit 
Strength and drive strength. This hypothesis receives experimental sup- 
Port from Perin’s finding that for a given number of reinforcements, re- 
Sistance to experimental extinction is an almost linear function of the 
number of hours of food deprivation at the time of the extinction pro- 
cedure (209). In his investigation of operant conditioning, Skinner (237) 
Ound that rate of response is considerably affected by the animal's 

‘Ive: over a wide temporal range of food deprivation, increase in drive 
oa faster rate of emission of responses (emptying of the reflex re- 

rve), 
_ “Although performance varies with strength of drive, there are some 
Mdications that appropriate stimuli will evoke a response even when the 
Subject is to all intents and purposes fully satiated (209, 330). Extra- 
Polation from Perin’s theoretical curves relating resistance to extinction 
© drive strength predict this result. But here, too, the last word has 
ot been spoken. Koch and Daniel (147) report zero or close to zero 
reaction potential immediately after satiation. Although the general 
“pendence of performance on strength of drive may be consicered as 
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well established, the precise quantitative nature of this relationship as 
ell as the problem of interaction of different drives (144, 145) is still in 


w 
need of further experimental analysis. 


THE PLACE OF THE LAW oF EFFECT IN LEARNING THEORY 


Parametric studies such as those described in the preceding section 
derive their main significance from the contribution which they can 
make to general learning theory. We shall now consider the role played 
by the law of effect in theoretical interpretations of learning. 

Theories of learning can be classified in more than one way. One 
can distinguish between molar and molecular theories, according to 
emphasis on specific movements as against stress on acts and their 
outcomes; or one can pit modern associationism against configurational 
theories. The role assigned to reinforcement by reward and punishment 
provides another criterion of classification. There are clear-cut lines of 
division separating systematic points of view according to the role 
assigned to the law of effect: (1) effect may be considered to be a princi- 
pal condition of all learning, (2) effect may be rejected as a condition 
of learning but considered an important determinant of performance, 
(3) the essentials of the learning process may be conceptualized without 
reference to effect, with reward and punishment assigned a subsidiary 
role and credited with only indirect influence on learning. 


The Law of Effect as a Principal Condition of all Learning 


The law of effect could not become the pivot of a comprehensive 
theory as long as it was restricted to the narrow universe of multiple- 
choice (“trial-and-error”) learning. It was only with its application to 
the facts of conditioning that the law of effect could become a unifying 
principle around which a systematic theoretical structure could be built. 

For a considerable period of time students of effect and students of 
conditioning failed to make contact with each other. In the field of 
classical conditioning stimulus substitution was the primary principle of 
explanation, To the extent that substitution could not account for all 
the empirical data, auxiliary hypotheses such as the principle of domi- 
nance and such concepts as set and attitude were introduced (105, 106). 
On the other hand, Thorndike firmly maintained the distinction between 
learning by selection and fixation (formation of S-R bonds) and con- 
ditioning, which he termed “associative shifting.” Thorndike never 
believed that the study of conditioning could throw much light on the 
nature of learning: ‘‘The conditioned reflex is one type of learning that 
manages, even more completely than maze learning, to conceal the true 
nature of the learning process in a mass of special conditions” (267, P- 
85). In the light of this opinion it is not surprising to find that Thorn- 
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dike has almost completely ignored the facts and theories of condition- 
ing in all his writings. Students of conditioning, on the other hand, 
could not indefinitely ignore the problem of effect, for the crucial role of 
incentive in the establishment and maintenance of conditioned re- 
sponses was an incontrovertible experimental datum. 

Paralleling Thorndike’s dichotomy between connection formation 
and associative shifting, several two-fold classifications of learning situa- 
tions have been made: classical and instrumental conditioning (108); 
Type S and Type R conditioning (236) ; conditioning and success learn- 
Ing (228); quantitative and qualitative conditioning (216) ; conditioning 
and motivated learning (74). Such classifications reflect important dif- 
ferences in experimental procedures under which learning can take 
Place, but, as Hilgard and Marquis (108) have emphasized, they do not 
represent pure types of learning which necessarily require principles of 
explanation as different as substitution and effect, In any given experi- 
ment, both types of learning may take place, and the classification of an 
experiment will largely depend on which aspects of the response the 
experimenter emphasizes and measures. A classical conditioning experi- 
ment emphasizes stimulus substitution and homogeneous reinforce- 
ment; an instrumental reward or escape experiment dramatizes the 
Principles of effect and heterogeneous reinforcement. “It isa common 
error to permit the reference experiment to dramatize a particular proc- 
ess, and then to suppose that the experiment represents a pure case of 
the process dramatized” (108, p. 97). 

Thus, the distinction between learning by selection and fixation 

heterogeneous reinforcement) on the one hand and learning by stimulus 
Substitution (homogeneous reinforcement) on the other does not pre- 
clude a unified conceptual scheme which makes a law of effect the basic 
Principle of all learning. It is the virtue of Hull’s theoretical analysis 
that these two types of learning are subsumed under a common princi- 
ple: they are both reduced to the operation of the law of primary rein- 
forcement (Hull’s formulation of the law of effect). The basic principles 
and generalizations of Thorndike and Pavlov are combined and unified 
1n a single theoretical structure (246). Hull’s law of primary reinforce- 
ment makes temporal proximity to a reinforcing state of affairs a con- 
ition without which learning cannot take place. Both selective learning 
the Thorndike situation) and conditioned-response learning are special 
Cases of the operation of this law. In the case of simple selective learning, 
One of many possible alternative reactions occurs in temporal proximity 
to need reduction and hence is differentially reinforced. Such a receptor- 
€ctor connection may or may not be of super-threshold strength at 

© beginning of an experiment. The conditioned response also depends 
On temporal proximity to reinforcement but in this case a mewreceptor- 
ector connection is almost invariably established: the connection be- 
tween CS and UR. Hull concludes that “the differences between the 
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two forms of learning are superficial in nature, i.e., that they do not in- 
volve the action of fundamentally different principles but only the 
differences in the conditions under which the principle operates” (127, 
p. 78). When the CR is considered basically akin to selective learning, 
the behavioral laws discovered in conditioning experiments can be ap- 
plied to, and integrated with, the phenomena of selective learning. The 
principles of conditioning in conjunction with the law of primary rein- 
forcement can be used in the deduction of more complex forms of 
learning. j 

Although the law of primary reinforcement is closely akin to Thorn- 

dike’s law of effect, there is an important difference which Hull has 
made explicit. As the law has been repeatedly stated by Thorndike 
it is both a law of motivation and a law of learning. In a review O 
Thorndike’s work Hull (123) raised the question whether motivation 
(striving) produces the learning, or learning produces the motivation, 
or whether some third and still more basic process produces both. 
Thorndike’s formulation seems to imply that striving is to be considere 

primary: he defines a satisfier as that which an animal strives to attain 
or does nothing to avoid. Hull, on the other hand, considers learning 
(“strengthening”) primary and derives striving from the principles © 
conditioning and the primary law of reinforcement (need reduction) a$ 
basic assumptions. In his own words, ‘‘states of affairs which organisms 
will strive to attain are reinforcing agents, not because they will evoke 
striving, but they evoke striving now because at some time in the past 
they were potent reinforcing agents, thereby joining stimuli and re- 
sponses . . . which constitute the striving” (123, p. 822). The point 1$ 
that Hull is willing to assume as originally given only those reinforcing 
agents which satisfy basic biological needs and are linked with the or- 
ganism’s survival, and he then derives other motives or strivings wit? 
the aid of the principles of conditioning. In Thorndike’s writings, 
the other hand, we find no such hierarchy of motives. The role 0 

motives (wants, interests and attitudes in learning) is described as two” 
fold: (1) they determine what response a situation shall evoke, and 2) 
the satisfaction of wants strengthens S-R bonds (276). It is to the 
question of the origin of these wants, interests and attitudes that Hull 
addresses himself, seeking to derive them with the aid of conditioning 
principles. : 

The gap between learning situations in which biological need reduc 
tion occurs and the myriad of learning situations in which there is ?¢ 
such immediate primary reinforcement is bridged by the principle ° 
secondary reinforcement. According to this principle, any receptor an 
tivity which regularly precedes a primary reinforcement will itse 
gradually become a reinforcing agent. The sight and smell of food 4 
well‘as other stimuli emanating from a feeding compartment, suc 
the click of the mechanism releasing pellets into a Skinner box (22, 237): 


3 — | 


PRESENT STATUS OF THE LAW OF EFFECT 533 


are sources of secondary reinforcement. A stimulus such as a buzzer or 
tone which has been regularly associated with shock also acquires rein- 
forcing properties (67, 69, 74). The reinforcing power of token rewards, 
i.e., rewards which may subsequently be exchanged for a primary re- 
ward such as food may similarly be explained in terms of derived rein- 
forcement (39, 65, 179, 207, 319). The effectiveness of social rewards 
and punishment calls for similar explanatory concepts: a practically 
ee chain of higher-order conditioning must be assumed (127, 

Recently Spence (247) has suggested that it is the particular stimu- 
lus pattern at the time of reward which acquires secondary reinforcing 
Properties. This reinforcing power is generalized to preceding stimulus 
Patterns according to a temporal gradient. In this conceptualization the 
vexatious problem of backward action is eliminated. 

It is in Hull's theoretical system that the law of effect has come to its 
full theoretical fruition: it has become a primary postulate with whose 
Correctness a complex structure of deductions and theoretical interpre- 
tations must stand or fall. 


The Law of Effect as a Law of Performance 


It is possible to accept the law of effect in a descriptive sense, i.e., to 
Tecognize that rewarded responses are usually repeated in preference to 
Non-rewarded ones, and yet to deny that the law of effect is a law of 

earning. Several writers, under the leadership of Tolman, have taken 
this position (64, 159, 287, 289, 292, 294, 315). A rigorous distinction is 
made between the acquisition and utilization of habits. The acquisition 
of habits depends on the formation of cognitive patterns within the 
Organism which reflect the stimulus relationships in the environment. 
uch cognitive patterns Tolman has described as ‘‘sign-gestalt expecta- 
tions” or “hypotheses” (287, 288, 290, 291). Organisms come to accept 
One event as a sign or “local representation” of another event (293). 
arning occurs when the subject has built up an expectation that a 
Siven sign in the environment will, via a behavior route, lead toa certain 
Significate or outcome. These expectations result from the organism’s 
Commerce with the environment and their acquisition is governed by 
Such conditions as frequency, recency, and perceptual laws of stimulus 
Organization (sign-gestalt formation).'* Differential reward is not con- 
Sidered a determinant of learning. Experienced reward does, however, 
Play a role as a determinant of performance or utilization of habits. 
From a set of alternative responses to a stimulus (sign) that response is 
Selected and performed whose consequence is most “demanded,” i.e., 
Most rewarding in terms of the momentary motivational state of the 


bili, |” building up expectations the organism essentially reacts to the ben proba- 
tian that signs in the environment will be followed by certain outcomes. ya! 
at theory stressing reaction to probability has been elaborated by Brunswik (21). 
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animal. Even though knowledge and need (learning and performance) 
are thus distinguished, behavior is always a joint function of both. 
When a subject moves toward a goal he must (1) have a need for that 
particular goal, and (2) have the knowledge that a given piece of be- 
havior on his part will lead to that goal. Hence needs and knowledge 
constitute an interdependent pattern or field (315). In this theoretical 
account, then, the law of effect is rejected as a law of learning and rele- 
gated to a secondary role as a condition of the moment-to-moment 
utilization of habits that have been acquired independently of effect. 
Only if motivation remains constant can behavior be accurately pre- 
dicted on the basis of the law of effect. 

Tolman calls his account of learning and performance a field 
theory!? which may be applied to substitute stimulus (CR) learning, 
trial-and-error as well as more complex forms of learning such as 
“inferential” and “inventive learning” (290). The learning theory put 
forward by the chief exponent of field theory, Kurt Lewin, (161) is 
closely akin to that of Tolman. Lewin sharply distinguishes between 
learning as change in knowledge or cognitive structure (differentiation 
of unstructured areas, restructurization) and learning as change in 
motivation (changes in valences and values). Changes in cognitive struc- 
ture are ascribed in part to the same type of “forces” as govern percep- 
tual fields, in part to the impact of the needs, values, and hopes of the 
subject. Among the complex of forces governing changes in movitation 
reward is only a minor factor, for “forces governing this type of learning 
are related to the total area of factors which determine motivation and 
development ” (161, p. 239). Throughout his treatment 
hasizes the need to distinguish the motivational from the 
cognitive problems, to study their separate laws, and then to determine 
the role of each type of factor in different learning situations. In this 
connection it should be noted that the distinction between learning 
and performance is not limited to the field-theoretical approaches. In 
Hull’s account, a parallel distinction is, made between the principles 
governing habit-formation and the principles governing habit use. 
Whereas the concept of habit strength describes the degree of acquisition, 
the construct of effective reaction potential refers to the degree to which 
a habit is ready for performance.” Thus both S-R theory and field 
theory allow for the distinction between learning and performance. ‘The 
difference is that for Hull motivational factors govern both acquisition 
and performance, whereas in the Tolman-Lewin formulation motiva- 


i i in the utilization of habits. 

tional factors come into play only in —_ 
One of the main experimental supports for the Tolman-Lewin view 

theory in field-theoretical terms, see Adams 


personality 
Lewin emp 


19 For another restatement of learning 
(1). 
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product of effective habit strength and drive, 


20 Effective reaction potential is a t 
ee inhibitory potential (127). 
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has been the phenomenon of latent learning. When a reward is intro- 
duced after a series of unrewarded trials in a maze, an improvement in 
performance occurs which far exceeds the usual effects of a single re- 
ward. A substantial part of this sudden improvement may then be 
ascribed to learning (formation of sign-gestalt expectations) which took 
place during the unrewarded trials but was not utilized in the absence 
of reward. Such analysis is strictly in accord with Tolman’s theory. 
The phenomenon of latent learning was first demonstrated by Blodgett 
in 1929 (14) and has been repeatedly confirmed since (49, 51, 102, 103, 
295, 304). A recent repetition of Blodgett’s experiment by Reynolds 
(220), however, failed to show latent learning. This result has, at least 
temporarily, detracted somewhat from the support which the latent 
learning experiment has provided for the field-theoretical view. To 
the extent, however, that latent learning has been successfully demon- 
strated it provides a serious challenge to any view that would make 
learning a cumulative function of successive reinforcements by reward. 
As Leeper (160) has pointed out, it is impossible to argue that some other 
reward, such as satisfaction of the exploratory drive, accounts for the 
learning. Such a reward should reinforce exploratory behavior and not 
running to the goal with few errors once a food reward is introduced. In 
this connection it is interesting to note that in a recent attempt to de- 
rive the facts of latent learning within the framework of Hull’s the- 
oretical system Seward (232) was forced to abandon Hull’s postulate 
(No. 4) that increments from successive reinforcement summate to 
yield a combined habit strength. Instead, he had to assume that con- 
ditioning is independent of reinforcement and is complete in one trial 
when S and R are simultaneous. On the other hand, Buxton (28) was 
able formally to derive the phenomenon of latent learning in terms of 
field-theoretical principles. , . 
Utilization of a habit depends on the presence of appropriate motiva- 
tion (appetite for the goal object). Performance of a given act does not 
always depend on the presence of any one particular motive: different 


Motives may be equivalent to each other in their capacity to evoke 


Performance to the extent that the outcome remains congruent pos 
ithin 


Motive that prevails at the time of performance (61, 62, 63, 64). 
imits, therefore, changes in motivation may leave behavioral responses 
(habit utilization) relatively unaffected, or lead to only partial changes 
in performance. Even total removal of reward may lead to an only 
temporary disturbance in performance rather than a permanent disin- 
tegration of a habit (310). When the subject is confronted with a choice, 

Owever, motivation may serve as the basis on which selection among 
alternative responses is made. On this hypothesis, if a subject were to 
learn that one route in a maze leads to food and another route leads to 
Water, he would be expected to choose the food route when hungry and 
the water route when thirsty. This deduction has been repeatedly put 
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to experimental test but the results and interpretations have thus far 
remained contradictory. 

In 1933 Hull (121) performed an experiment in which he was able 
to establish differential reactions to the same maze environment on the 
basis of different drives (hunger and thirst). He trained his rats in a 
simple two-route maze. The animals had to traverse one arm of the 
maze when they were hungry and were fed in the goal chamber. On 
days when they were thirsty, the animals had to traverse the other arm 
of the maze and received water in the same goal chamber. Hull's animals 
learned this discrimination only with great difficulty : 25 training periods 
of eight days each were needed before 80 percent correct responses 
were given on the first run of a series. Hull interpreted his results to 
show that animals can be conditioned to internal conditions (drive 
stimuli). The slowness and difficulty of the training would seem to 
run counter to the predictions of a sign-gestalt theory. The experiment 
was then repeated by Leeper (159) who fully realized its crucial signifi- 
cance for a field theory of learning. He introduced an important modifi- 
cation. In order to make a clear perceptual differentiation of the situa- 
tion possible for the subjects, he constructed a maze with two end-boxes, 
one containing food and the other water. When a rat made an incorrect 
choice, it entered the goal chamber which contained the reward not 
desired under its prevailing motivation. Thus the rat had continua 
opportunity to build up an expectation of “what leads to what.” 
Whereas in Hull’s experiment the rat was simply blocked on an incorrect 
trial, Leeper’s subject acquired information about means-end relations 
on every trial, correct and incorrect. As a result Leeper’s rats required 
only one eight-day period to reach approximately the same criterion 
as Hull’s rats after 25 eight-day periods.” Leeper felt that his results 
fully support Tolman’s theory and the distinction between acquisition 
and utilization of habits. 

The results of Hull’s and Leeper’s experiment are reviewed in some 
detail to show how virtually the same experiment lends itself to alterna- 
tive interpretations, one based on the law of effect and assuming sum- 
mation of reinforcement, the other making learning independent of 
effect. Subsequent studies of the same problem have remained equally 
inconclusive Kendler (146) raised the question whether results similar 
to Hull’s and Leeper's could be obtained if both hunger and thirst 
drives were simultaneously present during the training period. He 
found that animals learned to respond appropriately on the test trials, 
accordance with the motivation prevailing during the test trials. 
Although this result would seem to be exactly in accord with Leeper’s 


21 Hull explained the difference between his and Leeper’s results by suggesting that 
after the first few trials Leeper’s animals probably operated under both drives, thus 
being rewarded no matter which compartment they entered (127). Ina reply to Hull, 
Leeper denied on the basis of his experimental records that such was the case (160). 


i.e., in 
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interpretation, Kendler offers alternative explanations within the frame- 
work of an effect theory. He speculates that (1) only those drive stimuli 
which are reduced during the training trial are associated with the re- 
warded response, and (2) invokes anticipatory goal reactions as possible 
differential cues. ee 

Recently Spence and Lippitt (249) have reported ‘‘an experimental 
test of the sign-gestalt theory of trial and error” which is again con- 
cerned with the utilization of alternative habits under different motiva- 
tions. The subjects (white rats) in this experiment were motivated by 
thirst and given 12 days experience in a simple Y-maze. One arm of the 
maze always led to water. The other arm led to food for one-half of the 
subjects, to an empty goal box for the other half. The test trials were 
run under hunger motivation, with the thirst drive satiated. During 
these test trials the subjects continued to go down the alley leading to 
water and the group which had previously experienced food was not 
superior to the other group in learning to choose the food alley. These 
results are clearly contrary to what would be expected in terms of 
sign-gestalt theory. However, the same authors had previously re- 
ported (in abstract form) an experiment which was more in accord with 
Tolman’s theory (248). In this case, the subjects were satiated for 
both food and water during the training series and found water at one 
end of the maze and food at the other. When made hungry or thirsty, 
the animals chose the alley leading to that goal which satisfied the need 
prevalant at the moment. At that time Spence and Lippitt concluded, 
“Latent learning does not occur in the situation where animals perceive 
the subsequent goal object while motivated for another, but latent learn- 
ing does occur where complete satiation made for no particular goal- 
directedness.”” Thus the results of Spence and Lippitt do not call 
necessarily for abandonment of a sign-gestalt interpretation but for a 
Modification : experience with goal objects which satisfy a need and with 
goal objects for which there is no need during the training period are 
Not equally effective in learning (acquisition of expectations). ; , 

Just as the facts and interpretations of latent learning remain under 
discussion, the general problem of the mechanism of discrimination 
learning is still the cause of serious disagreement between proponents 
and opponents of an S-R-effect theory of learning. The experimental 
fact, on which there is general agreement, is that subjects can learn to 
choose between positive stimuli and negative stimuli simultaneously 
Presented, i.e., between stimuli response to which leads to reward and 
stimuli response to which fails to lead to-reward or leads to punishment 
(150, 154, 156, 157, 170, 205, 239, 241, 245). Even before the discrimina- 
tion has been learned, the animal does not respond ina haphazard 
fashion but shows definite systematic response tendencies, or hypoth- 
eses” (149, 150, 151, 152). It is around these systematic er 
tendencies during the pre-solution period that the main argument e- 
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tween “continuity” and “non-continuity” theories has been centered. 
According to the continuity theory, discrimination learning, like other 
types of learning, results from a cumulative process of building up an 
association between the positive stimulus cue and the response. Every 
time a response to the positive stimulus cue occurs and is followed by 
reward, the association between cue and response is strengthened; 
every time a response to the negative stimulus cue is made and fails to 
be followed by reward, the tendency to respond to this cue is weakened. 
Discrimination is established when the difference in the excitatory 
strengths of the positive and negative cues is sufficiently great to over- 
come other aspects of the total stimulus situation which are not con- 
sistently associated with reward or failure. In terms of this analysis 
discrimination learning is fully explained in terms of stimulus-response 
association and the law of effect. The continuity hypothesis of dis- 
crimination learning has been sponsored by Spence (240, 241, 242, 243, 
244, 245), and McCulloch (168, 169, 170). Spence has shown that a 
derivation of “hypothesis” behavior during the pre-solution period is 
possible in the framework of the continuity theory (240) as is a deriva- 
tion of the special type of discrimination learning studied in “‘trans- 
position” experiments (242). According to the noncontinuity theory 
as it is interpreted today, the animal learning a discrimination changes 
from one systematic mode of response (hypothesis) to another until the 
problem is solved but practice on the unsuccessful hypothesis does not 
contribute to the learning of the correct association. This view has been 
defended primarily by Lashley (157, 158), Kreshvesky (153, 154), and 
Haire (97, 98, 99). 

Both theories have been supported by important experimental 
investigations. A survey of the evidence suggests that the argument is 
still in an inconclusive stage: The crucial experimental question is 
whether associations are formed during the pre-solution period which 
significantly influence the subsequent establishment of the discrimina- 
tion. In a typical investigation of this problem, the significance of the 
stimulus cues is reversed during the pre-solution period; the cue whic 
is to be positive on the test trials is made negative and vice versa. 
According to the continuity theory such reversal should slow down 
learning on the test trials, according to the non-continuity theory the 
reversal should have no effect. The first demonstration of the cumala- 
tive effect of training was presented by McCulloch and Pratt (170) 
in their study of weight discrimination by white rats. | Preliminary 
training to the lighter of two weights slowed up learning when the 

ht was made positive. In a study of visual form discrimina- 


ier weig a 
ge of chimpanzees Spence (241) similarly found that the 


establishment of a discrimination was directly dependent on the excita- 
tory serengths of the positive and negative stimuli. The greater the 
relative number of reinforcements a given stimulus had received in a 
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series of learning tasks, the easier it was to establish a positive response 
to that stimulus. The correlations obtained were positive, high and 
Significant. Spence also showed that sudden learning (insight) like 
gradual learning, was positively correlated with the excitatory strengths 
of the positive and negative stimuli as determined by the number of 
reinforcements (243). 

Not all the evidence has been on the side of the continuity theory, 
however, Using the technique of reversing the positive and negative 
stimuli during the pre-solution period, Kreshevsky (154) found that the. 
reversal did not significantly affect the learning of the correct solution. 
Lashley (157) after repeating Spence’s study of form discrimination 
with rats argued that evidence from positive correlations between 
number of reinforcements and ease of discrimination learning is incon- 
plasive, He considers the assumption unwarranted that a high corre- 
ation between two arrays cannot exist when the values of one are in 
Part determined by chance. In computing his own correlation Lashley 
emitted all trials showing systematic reaction to position, thereby 
Changing the excitatory values of the stimuli by random measures 
rom 0 to more than 100 percent, and yet the correlations between 
number of reinforcements and ease of discrimination learning were not 
reduced but slightly increased! Lashley then offered experimental 
Proof against the assumption that all stimuli acting at the time ofa 
response tend to be associated with that response. Rats who learned 
n respond in terms of size did not at the same time learn to respond in 
a ne of that shape which was consistently associated with the positive 
stin aa “If the animals are given a set to react to one aspect of a 
dae situation .. . large amounts of training do not establish asso- 

1on with other aspects, so long as the original set remains effective 
i Teaching the food” (157, p. 259). Instead, Lashley emphasizes the 
3 € of perceptual organization and attention in determining which 
Spects of the stimulus situation will be associated with the response. 

he controversy still continues. In 1945 Spence (245) published a 
carefully designed experiment in which he again used the technique of 
Cue reversal, Great care was taken to control all relevant factors in- 
a uding position habit. Cue reversal significantly retarded learning and 
> ae led Spence to conclude that the development of an association be- 
of een a cue and a response is a cumulative process which is independent 
per tematic response tendencies to other cues during the training 
Sg Discrimination depends on the number of times the animal has 

n rewarded in the presence of the positive cue. 

Snape spite of Spence’s impressive results, the last word has not been 
lan: o Recently Lashley and Wade (158) published a stringent criti- 
es of “neo-Pavlovianism.” They singled out for theoretical and 
ie imental attack two assumptions central to ‘‘neo-Pavloviaaism,” 
““ to modern effect theory: (1) that in conditioning all aspects of the 
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stimulus situation are associated with the reaction, and (2) that the 
effects of reinforcement spread to stimuli other than those present during 
training (stimulus generalization). Lashley and Wade believe these 
principles to be contrary to fact. In their experiments, groups of sub- 
jects (rats, monkeys) were trained to react positively toa given stimulus. 
The stimulus would then be opposed in a discrimination experiment to 
another on the same stimulus dimension. Is some cases, the reaction 
to the initial stimulus was reinforced, i.e., the initial stimulus remained 
positive; in other cases, the reaction to the initially positive stimulus 
was extinguished, i.e., it was made negative in the differential training. 
The rates at which discriminations could be established under these 
two conditions were compared. In every case differential training was 
faster when the initial reaction was extinguished than when it was 
reinforced! The differences were consistent though not statistically 
reliable. On the basis of these results Lashley and Wade reassert that 
in discrimination relatively few, often not more than one, aspects of the 
stimulus are effective in the choice reaction of the animal. ‘‘A definite 
attribute of the stimulus is ‘abstracted’ and forms the basis of reaction; 
other attributes are not sensed at all or disregarded” (158, p. 81). 
Stimulus generalization is ascribed to failure of association. When a 
subject responds to a stimulus to which he was not originally trained, 
he does so because of his failure to attend to those characteristics which 
distinguish the training stimulus from the stimulus to which the reac- 
tion is generalized. On the other hand, a subject establishes a differ- 
entiation when he forms associations with such aspects of the stimuli 
as he had not attended to in earlier stages of his training. The burden 
of the argument is a denial of automatic irradiation and indiscriminate 
association of stimuli with responses by virtue of sheer temporal proxim- 
ity to a reinforcing state of affairs. 

The crucial conceptual distinction which the field-theoretical ap- 
proach has introduced is between learning and performance. Though 
recognizing the logical status of this distinction McGeoch seriously 
questioned its experimental usefulness. ‘‘The only way we can know 
that learning has occurred is by an observation of successive perform- 
ances since learning is a relation between successive performances. 

_,. Assertions that motive and effect influence performance but not 
learning become meaningless in the absence of quantitative demonstra- 
tion, a demonstration which cannot be made without measurements of 
performance” (171, p. 599). Replying to this criticism, Leeper (160) 
pointed out that different test conditions may result in very different 
performances following the same learning situation. In strict conform- 
ance with McGeoch’s view one would then have to conclude ter 
any learning situation results in a large number of “‘learnings.’ A muc 
more parsimonious approach consists in the use of such distinct inter- 
vening variables as learning and performance, or, in Hull’s language, 
habit-formation and habit-evocation. 
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Another difficulty which results from the distinction between learn- 
ing and performance is the conceptual gap which is left between knowl- 
edge and action, If learning consists in the building-up of expectations, 
how is expectation translated into action, and how can the specific 
character of action be predicted? The field theory predicts what a sub- 
ject will come to expect but it fails to predict in any specific way what 
he will do as a result of the expectation. Guthrie, who has long been 
concerned with the analysis and prediction of particular movements, 
has expressed this criticisth as follows: “Signs, in Tolman’s theory, . 
occasion in the rat realization, or cognition, or judgment, or hypotheses, 
or abstraction, but they do not occasion action” (90, p. 172). Hilgard and 
Marquis (108) regard this failure to predict specific action as a weakness 
but also see in it a source of strength. A breadth of interpretation is 
Possible which strict conditioning theories do not allow. A variety of 
Performances can be grouped together in terms of the purpose which 
they serve without regard to particular details of movement. 


Rejection of the Law of Effect as a Basic Principle of Learning or 
Performance 


It is his insistence on an analysis of the learning process in terms 

of specific stimuli and specific response movements which has led 
Guthrie (88, 89, 90, 91, 92, 93, 94) to the formulation of a theory in 
which the law of effect has no place as either a basic principle of learn- 
Mg or of performance. Guthrie distinguishes strictly between acts 
and movements. An act is a class of movements defined by the end 
result. The law of effect with its emphasis on the consequences of S-R 
it pnections thus applies primarily to acts. In Guthrie’s view, however, 
: fe to movements and not acts that the basic laws of learning must 
i er. The achievement of an act, i.e., of an effect or end result, is “com- 
bietely dependent on the acquisition, through learning, of a specific 
stereotyped movement or set of movements for the accomplishment of 
the effect that defines the act” (94, p. 51). As to the association of 
ponuli and response movements, it is entirely explained in terms of one 
asic principle—association by contiguity. It is always the last move- 
Paid or set of movements made simultaneously with a given stimulus 
D at is repeated on recurrence of the stimulus. Only a single coincidence 
5 Stimulus and response is necessary to establish the association. Thus 
tae is accomplished in one trial but so is unlearning, for one presen- 
as ‘on of the stimulus on which the response is not made destroys the 
SOciation, Although all learning is complete in one trial, the process 
te Acquisition proceeds only gradually because of the continuously 
whens nature of the components of the stimulus situation. Only 
cont the response movements have been conditioned to the various 
©Mponents of the stimulus situation—exteroceptive, proprioceptive, 

interoceptive—can the response be reliably evoked. 
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Guthrie’s description of the basic processes in learning is made inde- 
pendently of the action of rewards and punishments. The practical 
efficacy of reward and punishment can be satisfactorily accounted for 
in terms of stimulus-response contiguity. It is always the response last 
associated with a stimulus which will be repeated when the stimulus 
recurs. A successful response either removes the organism from the 
stimulus situation (for example, by allowing escape from a puzzle-box) 
or so alters it (for example by removal of internal drive-stimuli through 

_ feeding) that no new associations with the situation can be formed. 
Similarly the effectiveness of punishment depends on what it makes the 
organism do in the presence of a stimulus (for example, make with- 
drawal movements), not on what it makes him feel. It is the function 
of reinforcing agents to protect the associations made. Effect prevents 
unlearning and allows the law of recency to operate. 

Guthrie’s explanation of effect in terms of changed stimulus context 
has been criticized on the ground that rewards may be effective even 
though they do not clearly alter the stimulus situation (169). Eating 
a small pellet of food, for example, does not remove the drive stimul.s 
and yet is effective in strengthening an S-R connection. Guthrie be- 
lieves that even with a small reward the stimulus situation is materially 
altered since the ‘“annoyance’’—the restless, excited behavior which 
precedes feeding—is removed though the “annoyer” (the drive stimuli 
maintaining the animal’s search for food) persists. Removal of annoy- 
ance removes the cues for successful action and thus prevents unlearning 
(92). This reformulation is not entirely conclusive. After eating, the 
animal may soon again be in an aroused state, the annoyance may re- 
turn, and previous responses would be expected to be unlearned. “It 
is obvious that something has changed when the animal ceases to run 
or to push levers and begins to eat, but a theory must state more pre- 
cisely just what the change is which guarantees the learning of the prior 
response if the theory is to be verified experimentally” (108, p- 92). 

As to experimental verification of the theory, the most outstanding 
work is that of Guthrie and Horton (95) who reported a detailed, wel 
documented study of cats’ behavior in a puzzle box. The filmed records 
of the animals learning to escape from a puzzle box are well in accor 
with Guthrie’s theoretical expectations. The most striking characteris- 
tic of the subjects’ behavior was its repetitiousness. The movements 
immediately preceding escape were distinguished by their stereotypy 
since removal from the situation presumably protected the last associa- 
tion formed. Whatever variability of behavior appeared is explained by 
changes in the stimulus situations which caused new associations to be 
established. 

More recently, an experimental study based on Guthrie’s theory 
was reported by Seward (230) who was interested in testing the “finality 
theory of reinforcement.” He divided his subjects (rats) into two 
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groups. One group was given food upon pressing a bar, the other group 
was removed from the experimental situation when it had made the 
response. Though both groups learned to press the bar, the reward 
group was clearly superior to the removal group. Though removal from 
the situation protects the last association, the total effectiveness of 
reward cannot be ascribed to the termination of the situation. 

Stimulus-response contiguity theory has been criticized as not easily 
lending itself to experimental test (108, 246). The continually changing 
stimulus elements, especially the subjects’ own proprioceptive impulses, 
are not readily controlled and manipulated. Usually changes in the 
stimulus situation have to be inferred from the failure of a response to 
be repeated. Such inferences are, of course, circular. The main diffi- 
culty lies in the highly specific nature of the stimuli and responses in 
terms of which the analysis is made. The theory is distinguished by logi- 
cal and consistent formulation; it exemplifies a coherent account of 
learning developed without recourse to a law of effect. But it is still 
greatly in need of experimental verification. 


Tur ROLE OF EFFECT IN COMPLEX LEARNING 

Our survey has shown the pivotal role of the principle of effect in 
the construction of a systematic theory of learning. Experimental tests 
of the various theoretical propositions have been conducted almost 
exclusively in the animal laboratory (Hull, Tolman, Guthrie) or with hu- 
man subjects in simple rote-learning situations (Thorndike). “Proofs” 
and “disproofs” of the law of effect in such experiments still leave 
Open the question of what role effect (satisfaction) plays in more complex 
types of learning. The question is sharpened by two types of observa- 
tion: on the one hand much learning occurs in the absence of demon- 
Strable drive reduction, and on the other hand satisfying responses 
sometimes fail to be repeated (3). The challenge of complex adult learn- 
ing which so often seems to defy the principle of effect, can be met in 
two ways. It is possible, with Mowrer (185), to defend the law of effect 
as a universal principle of learning despite the apparent failure of many 
learners to obey it—by reformulating the law of effect so as to encom- 
Pass these more complex forms of learning. On the other hand, one 
May-assert, as Allport has done (3, 4), that the law of effect holds only 
for animals, small children, mental defectives and in some peripheral 
Phases of adult learning, whereas different principles, such as ego- 
involvement and active participation, need to be invoked in the analysis 
of more complex adult learning. Thus, for Allport the law of effect is 
Not a law of learning but merely one of the many conditions which may 
favor learning—a condition, moreover, which applies only to a, very 
circumscribed segment of learning behavior. The controversy between 
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these two views is still in progress and is well exemplified by the recent 
symposium on “The Ego and the Law of Effect.” 

In attempting to extend the law of effect to learning which is not 
motivated by physiological need reduction the concept of derived or 
secondary drive plays a central role. A secondary drive is a learned drive. 
“These secondary drives are acquired on the basis of primary drives, 
represent elaborations of them, and serve as a facade behind which the 
functions of the underlying innate drives are hidden” (177, p. 19). 
' Under the heading of derived drives come anxiety and anger and such 
social needs as pride, ambition, and desire for social approval. Miller 
and Dollard (177) suggest that self-induced stimulation is the basis 
of acquired drives as well as the basis of acquired rewards and purposes. 
“Most of the responses which are the basis of socially significant ac- 
quired drives and acquired rewards are internal responses . . . "fp 58) s 
In the course of learning, any stimulus cue which acquires the ability 
to reduce a secondary drive acquires reward value. 

Secondary drives are learned but they may also serve as the basis 
for further learning since the reduction of a secondary drive is a reward 
which may reinforce S-R connections in accordance with the law of 
effect just as does the reduction of a physiological drive. By postulation 
of such derived drives the gap between animal learning and social learn- 
ing may be bridged and analysis of the latter in terms of stimulus, re- 
sponse, and effect (tension reduction) is made possible. Miller and Dol- 
lard’s book, Social Learning and Imitation (177) represents an attempt 
to apply this analysis to such social behavior as imitation, leadership 
and crowd action. May (175) has attacked the problem of war and peace 
in a similar conceptual framework, and Whiting (316) has applie 
stimulus-response-effect analysis to the process of socialization in & 
primitive society. In terms of such analyses, a complex network o 
acquired drives develops out of the primary biological drives, and it 1$ 
through the reduction of these acquired drives that the process of sociali- 
zation operates. When adult learning seems to proceed without obvious 
drive reduction we must search for the derived drives whose satisfaction 
makes learning possible. , 

The concept of derived drive has been experimentally fruitful. 4 
series of experiments conducted over the last ten years or so by Mowrer 
is a notable demonstration of the extent to which this concept has PFO” 
duced testable hypotheses. Mowrer has made a determined effort tO 
explore the implications of the law of effect to their limits and to ant 
upon.the foundations of this law a unitary account of learning whic 


22 Psychol. Rev., 1946, 53, No. 6 (4, 185, 221). 
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would have equal applicability to animal learning and to complex 
social learning. The concept of derived drive appears to be the touch- 
Stone of this theoretical edifice. Mowrer has demonstrated, for example, 
that preparatory set or expectancy may function as a drive whose reduc- 
tion serves as a motivating factor in learning (180, 183, 190). A state 
of tension or discomfort arises not only from the presence of a basic 
organic need but also from the anticipation of the recurrence of one or 
more of these needs. Not only does the application of an electric shock 
result in tension but also the anticipation of the shock, and this antici- 
Pation constitutes a derived drive. This state of tension is high before 
the occurrence of the punishment and low immediately thereafter (183). 
Reduction of this anticipatory tension is rewarding. ‘‘Other things 
being equal, the greater the extent of the drop in the expectancy-tension 
after the occurrence of a stimulus-response sequence the greater the 
reinforcing or learning-inducing value of this drop” (183, p. 38). 
Mowrer’s stimulus-response analysis of anxiety (181, 182, 190) and 
fear (189) is closely related to his conception of expectancy. Anxiety is 
the (learned) anticipation of a noxious stimulus. This anticipation, 
which is a source of tension and discomfort and hence acquires drive 
quality, results in a variety of acts from which are selected and fixated 
(by the law of effect) those forms of behavior which are most instru- 
mental in the reduction of anxiety. The postulation of anxiety and 
anxiety-reduction explains the seemingly paradoxical finding that a 
Conditioning procedure which permits avoidance of a shock results in 
better conditioning than a procedure which merely allows escape from 
the shock or mitigation of the shock (17, 190). Avoidance results in 
anxiety-reduction which provides reinforcement even though the noxi- 
Ous stimulus itself is not delivered. The effectiveness of avoidance and 
Concomitant anxiety reduction depends in part on the temporal se- 
quence of the noxious stimuli. If shocks are presented at regular tem- 
Poral intervals, better conditioning results than with irregularly spaced 
Presentation. When stimuli come in a regular order, anxiety mounts to 
a maximum as the time approaches for presentation of the stimulus, then 
Tops: With irregular presentation, the subjects are “kept in a more or 
ess chronic state of apprehension or suspense” (182, p. 510). Mowrer 
and Lamoreaux have also been able to show that in avoidance con- 
ditioning, the CS becomes a source of anxiety whose termination is 
ewarding in itself independently of the reinforcement provided by 
avoidance of the shock (189). As a result, conditioning is better if the 
(source of anxiety) terminates at the moment when the conditioned 
"sponse occurs than if it ceases either before or after the response. The 
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fact that the CS in avoidance training becomes itself a source of anxiety 
and thereby a motivating factor in its own right helps to establish con- 
ditioned avoidance responses which are radically different from the 
responses made to the unconditioned stimulus. Thus some of Mowrer 
and Lamoreaux’s rats were trained to jump in order to terminate the 
CS whereas the “correct” response to the UcS was running, and vice 
versa. Mowrer and Lamoreaux conclude that there are two distinct 
sources of reinforcement in avoidance conditioning, the one tending to 
strengthen the connection between fear and whatever response reduces 
the fear and the other which strengthens the connection between fear 
and whatever response eliminates the situation (shock and fear com- 
bined). They believe that the recognition of fear as a secondary motive 
in learning has important systematic.implications for the law of effect. 
“In this way so-called anticipatory or ‘foresightful’ behavior can be 
made to conform very acceptably to the requirements of the Law of 
Effect, and seemingly ‘purposive’ or ‘teleological,’ responses are ac- 
counted for well within the framework of scientific causation” (189, 
p. 48). 
Even though such intervening variables (secondary drives) as an- 
ticipation, anxiety, and fear” help to subsume under the law of effect 
behavior which is seemingly foresightful or purposive, a vexatious prob- 
lem remains. Why do organisms so frequently persist in behavior which 
is clearly punishing, or, at least, more punishing than rewarding? It 
is, of course, possible to argue that whatever behavior subjects persist 
in must somehow be rewarding or it would be abandoned. Such reason- 
ing would, however, be clearly circular and would beg the very question 
at issue in discussions of the law of effect. Recently Mowrer and Ullman 
(192) have attacked this problem of “non-integrative” learning. They 
believe that the key to the riddle of non-integrative learning lies in the 
temporal pattern of rewards and punishments. Many acts have con- 
sequences which are both rewarding and punishing. According to the 
gradient of reinforcement, those consequences which immediately follow 
the act will be more influential in learning than temporally remote 
effects. Integrative learning occurs when remote consequences are 
through symbolic behavior brought into the psychological present 
Non-integrative learning takes place when the organism fails to mak 
the symbolic bridge to the future and remains at the mercy of the im- 
23 Another derived drive is fatigue. In extinction, fatigue is the motivation competing 
with the performance of the act. Not to respond reduces the fatigue and is therefore 
rewarding. This analysis is supported experimentally by the fact that effortfulness ° 
task is inversely related to the number of extinction responses. Thus the process of ex- 
tinction is explained in terms of effect of need reduction (186). 
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mediate gradient of reinforcement. Itis the failure to react to remote 
punishing consequences which is the basis for the persistence of behavior 
in the face of punishment. This analysis holds well for the behavior of 
Mowrer and Ullman’s rats who, indeed, showed only a very limited 
capacity to learn in terms of temporally remote consequences. But 
what about non-integrative learning in humans capable of reactions to 
symbols? In face of this difficulty the authors unfortunately fall back 


on a question-begging argument: “The fact that certain habits or l 


‘traits of character’ may persist in the face of consistent punishment, 
raises a more difficult problem. One possibility is that punishment, 
which is observed, is offset by self-administered reward, which is not 
observed” (192, p. 85). But on this supposition the law of effect is in- 
capable of disproof since whatever learning occurs is ipso facio con- 
sidered the result of reward administered either visibly by others or 
invisibly and (often unobservably) by the self. Mowrer and Ullman 
Probably sense this difficulty since they admit in the same paper that 
“there may be more to the problem than this and that the ‘strength of 
the total ego’ seems capable in ways which have not yet been clearly 
analyzed, of being mobilized in support of any single part (habit) for 
Which the going is particularly hard” (p. 85). 

In the symposium already referred to, Mowrer (185) has further 
Clarified his view of the relation between the law of effect and “ego- 
Processes.” He does not believe that the type of learning which is 
described as “ego-involved” poses problems which the law of effect 
cannot answer. For the term “ego-involvement” he would substitute 
the term “interest”; for “interest” he would in turn substitute “emo- 
tional arousal”; and emotional arousal (e.g., fear) is a derived drive 
whose satisfaction is not in principle different from the reduction of 

© so-called primary, or biological drives. By such successive reductions 
, OWrer achieves the unification of all motives which may be effective 
Be learning and staunchly defends the law of effect as the sole true law 
© learning. “Learning occurs when and only when a drive is reduced, 
8 Problem solved, a satisfaction derived, but . . . this satisfaction may 
Stem rom the reduction of either a primary or secondary drive (185, 
Bi 332). Te is the mediation of effect through symbolic processes, 


*In his most recent statement Professor Mowrer (185a) has reversed his eae 
° Some extent. He now admits that the law of effect has failed as a universal mee ic 
principle of learning and suggests that it may be neat to assume ee 
Cesses: es solutions to problems, i.e., ordin: y 3 
Required ang e ees emotional learning or “conditioning ben eos 
sabit formation or en solving is mediated by the law of effect whereas con iss i 
= Emotiona] nite is conceptualized as a process of “association” independe: 
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especially self-administered rewards, which distinguishes ego-involved 
learning from learning which is reinforced by the reduction of physio- 
logical needs. 

An attempt rather similar to Mowrer’s, though less orthodox in 
formulation, to bridge the gap between ego-psychology and the law 
of effect was made by Rice (221). Rice admits that the law cannot be 
held if it means that success or satisfaction leads to (a) a perpetuation 
of specific responses and/or (b) the repeated choice of the same specific 
goal object: When normal adults repeat one of these two elements, 
it is usually with a variation in the other, or both response and goal 
are varied. But even though either specific response sequence or goal 
or both may be varied, it may still be true that something about the 
activity is repeated, such as the “interest” that is involved. Rice sug- 
gests a reformulation of the law of effect. Success or satisfaction does 
not stamp in a specific stimulus-response connection but it does con- 
firm the learner's interest in the general range of problems in which he 
has been successful. To the old question, what is it that success ‘‘stamPS 
in,” Rice then offers the answer that it is “interest.” Interests them- 
selves, even the processes to which the term ego refers, are acquired and 
perpetuated through the operation of this modified law of effect. Like 
Mowrer, Rice believes that symbolic processes (self-administered re- 
wards) play an important role in the mediation of the effect. ‘That 
core of the act which constitutes the ‘interest’ is the feature of it which 
is most likely to be symbolized and repeatedly confirmed through ap- 
proval of its symbol” (221 p. 316). 

The papers of Mowrer and Rice make it clear that it is logically 
possible to reformulate the law of effect and to describe its operation in 
such a way as to make it universally applicable to animal and ego- 
involved” learning alike. But as Allport’s reply (4) to these papers 
shows, the psychologist who is primarily concerned with the analysis 
of ego-processes cannot be satisfied by such reductions and reformula- 
tions. Allport rejects the reduction of ego-processes to emotional arousa 
(secondary drives). Only certain emotional states are ego-involved an 
the two states should not be considered identical. Indeed, it may be 
the lack of emotional arousal which characterizes the smooth function- 
ing of some ego-processes. Any law of learning based on “reduction © 


mediated by 
tonomic 
allotte' 


pleasure or tension reduction. Mowrer also suggests that effect learning is 
the central nervous system whereas emotional learning is carried on by the au 

s system. This reformulation considerably curtails the sphere of operation a 
a of effect. It remains to be seen whether this particular dichotomization wi 
ey petler than previous attempts at mutually exclusive classifications of the 


learning process. 
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emotional tension” can not indiscriminately be applied to learning that 
Proceeds from ego-interests. Allport staunchly maintains that the role 
of ego-processes is “irreducible.” There are other basic phenomena of 
learning for which effect theory cannot account satisfactorily. The 
favorable influence of motor activity and active participation resists 
reduction to pleasure or satisfaction. Above all, the ubiquitous direc- 
tive action of interests must be taken into account in any comprehensive 
theory of learning; “. . . learning proceeds because it is relevent to an _ 
interest system: it adds to knowledge, it differentiates items within the 
System, it broadens the range of equivalent stimuli... . Pleasure at- 
tending a single response, or even concatenations of response, is not 
€cisive” (4, p. 346). The capacity to relate environmental events to 
one’s interest system, to discriminate between relevant and irrelevant 
Means, far transcends the operation of effect, however broadly con- 
Celved, 
In the light of such considerations Allport considers the law of effect 
asa Secondary principle in learning. He believes that satisfaction plays 
a decisive role only in those organisms and those situations in which 
°80-Drocesses are not involved. In less mechanical forms of learning 
Satisfaction loses importance and can be maintained only as a question- 
egging concept. The symbols which are invoked as mediators of self- 
reward are “Vague molecular constructs that taper off into a kind of 
aimless triviality so far as explanatory power is concerned” (4, p. 344). 
ae Satisfaction or success serves as an indicator to the learner of 
well he is adjusting to his problem. But though satisfaction helps 
healed perceive the situation he uses this indication ina variable 
ner according to the pattern of interests that comprise his ego- 
Structure, 
CONCLUSIONS 


In the course of this survey we have touched on learning behavior 
s ging from conditioned salivation to the development of interest 
SYstems in mature adults. To all these types of learning the law of 
ect has been applied by some, only to be rejected by others. In spite 

Š an ever-growing volume of experimental and theoretical papers, 
A Peten still seems to be a long way off. The golden jubilee of the 
of effect will be celebrated in the not-too-distant future, but some 
€ basic issues for learning theory which the law of effect had hoped 


o P 
answer still remain: 


isi What is the agent responsible for reinforcement? Js it association by con- 


Sati a an OK reaction; the reduction of tension, primary or derived; the 
Staction of an interest; the confirmation of a cognitive expectation; the 
Val of the ego? 
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2. What is it that is reinforced? Is i 
perceptual organization; an interest system; the ego? 

3. What is the basic mechanism of reinforcement? 
synaptic resistances; cortical irradiation; or ar 


t a neural bond; an S-R connection; a 


Is it the lowering of 
e the physiological processes 


which will find ultimate acceptance still unnamed? 


Perhaps there is no one right answer among these alternatives. 
Probably more than one of these principles and additional ones that 
remain to be discovered will be needed in an integrated theory of learn- 


ing. It is safe to say that 
of effect as a monistic princip 
stantiated. As one of the behavi 
gainsaid. 


at the present state of our knowledge the law 
le explaining all learning has not been sub- 
oral facts of learning, it cannot be 
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DISCUSSION OF SNYDER’S “THE PRESENT STATUS OF 
PSYCHOTHERAPEUTIC COUNSELING’* 


J. L. MORENO 
Beacon, N. Y. 


Psychiatrists have frequently claimed that psychologists are not 
“Gt” to practice psychotherapy, but their attitude is changing in recent 
years, Psychologists, in turn, often assert that they are superior to psy- 
chiatrists in the use of scientific methods. However, an objective ap- 
proach and understanding—from both sides of the fence—is of the 
utmost importance if we are ever to attain a united front among the 
psychotherapists. By articles like the one of Snyder the existing chaos 
in psychotherapeutic counseling will be increased and the cleavage 
between psychiatrists and psychologists further widened.** The author 
frequently stresses that psychiatrists are commonly inclined to be un- 
scientific, subjective and intuitive in approach, whereas psychologists 
believe in experimental methods and scientific validation; that psy- 
chiatrists indulge in esoteric terminology, but that psychologists use 
terms which are universal, that is, accepted everywhere by the scientific 
fraternity. Such bias, which runs intermittently throughout the review 
comes to a head in the statements directed against Freud and Moreno- 
In the chapter on psychoanalysis he says: 

One difficulty for the nonpsychoanalytically trained reader is that the 
vocabulary of psychoanalysis is so esoteric that it frequently has little meaning. 
_,. the majority of psychiatrists in this country and almost every psychologist 
are more likely to agree with Blanchette (?) that psychoanalysis is a medica 
psychology without adequate roots in or connections with scientific medicine 


or scientific psychology. 


In the chapter on psychodrama he says: 


d intuitive in approach. ... Moreno pos 


Moreno himself is unscientific an 
terms 


tulates relationships and behavior patterns which are explainable only in 
of his esoteric system. In this regard his method is similar to Freudian psy- 
choanalysis (p- 333). 
s common knowledge that psychoanalysis is rooted in French 
the school of Charot. If it added some original ideas to 
this in itself should not make it unscientific. T° 
ychol. 
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treat Freud in an article on therapeutic counseling as if he would be a 
dead dog is ridiculous. He was a great empiricist, he formulated his 
hypotheses with care and discarded them when new observations seemed 
to warrant it. Because he was pre-experimental, this does not mean 
that he was unscientific; as the psychology of the individual was in its 
infancy when he approached it, he was as scientific as its stage of devel- 
opment permitted. Unfortunately, the words “scientific” and “esoteric” 
are used by Snyder in a moralizing, almost schoolmasterly way. All 
scientific terms have once been esoteric and many are now universally 
accepted. Should one hesitate to introduce new ideas and coin new 
terms because they are bound to be considered esoteric by ur contem- 
Poraries? It should not be permitted that fear of new ideas and resent- 
ment of originality raise their head under the guise of a narrow concept 
of science and of how new ideas emerge. 

The treatment which psychodrama and sociometry receive at the 
hand of Snyder is intuitive and esoteric, to use his own terminology. If 
he would have studied the subject more thoroughly he would have 
learned that the outstanding development in counseling in recent years 
's closely linked to sociometric and psychodramatic methods. It has 
Nothing to do with the controversy ‘directive-nondirective”’ which 
Psychiatrists have been carrying on for more than fifty years. In 

sociometric counseling” the counselor may be anonymous; the counse- 
lor is not only nondirective but may be altogether absent from the scene 
when the therapeutic choice process takes place. The sociogram, 
Mapping the emotional currents operating among the members of a 
Sroup, becomes the objective reference upon which his recommenda- 
tions as counselor are based. The process of involvement is in principle 
Temoved “from the counselor-client relationship to the client-client 
relationships. 

Psychodramatic counseling is more complex; practically all problems 
Which appear singly in each of the current psychotherapies reappear 
in a psychodramatic session magnified or in new combinations. Because 
there are many types of problems which the clients bring to us there 
are many versions of psychodrama. There has not been found a single 
Method which can be applied universally. The oldest and most popular 
orm of psychodrama is the self directed form, in which the self of the 
Client directs the production exclusively, director and auxiliary egos 
assisting him in the production if he wishes. In another version of 
Psychodrama the director is outside of the situation, the therapeutic 
Process takes place between the two or more clients involved in the 
Conflict, 
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There can be no greater contrast than the one between the non- 
directive interview and self-directed psychodrama production. Non- 
directive puts a premium on a minimum of self expression of the 
counselor; psychodrama puts a premium on the maximum of self ex- 
pression of the client. The word nondirective is often misleading. It 
is a negative term, implying that the spontancity of the client should be 
permitted to express itself without interference from any source. Non- 
directive writers talk often the language of spontaneity in reverse; they 
are worried about making the counselor as unstimulating to the client 
as possible but not worried about whether the subject is stimulated to 
be himself. Various interviewers have been observed in practice; it 
was found that, however cautious they may be in their questionings, On 
the action level they assumed various attitudes which appealed to the 
subject in uncontrollable ways. One counselor may have a fatherly way 
of taking the role, smiling approvingly; the other a businesslike way; 
the third may be withdrawn and indifferent; all three may use the same 
technique of questioning but they have a different effect upon the client. 
It was evident that the very physical appearance of the counselor was 
bound to influence the client; it has, as we sociometrists phrase it,a tele 
effect. 


Mr. Snyder refers to one of my papers “Scientific foundations of 
group psychotherapy.” He is unaware that this article is just a sum- 
mary, that it refers back to fifteen years of research in which several 
hundred workers have collaborated, contained in nine volumes of 
Sociometry, books like Moreno’s Who Shall Survive?, Jennings’ Leader- 
ship and Isolation, monographs like ‘‘Sociometric statistics of social 
configurations,” “The measurement of sociometric status, structure an 
development,” etc. The climax of ignorance for the meaning of psy- 
chodrama and group psychotherapy comes forth in Snyder’s statement: 
“In evaluating psychodrama it is perhaps of significance to note that 
Moreno himself performs for the public at two weekly sessions in New 
York where admission is charged” (p. 334).* This is obviously 4 
derogatory remark. Snyder seems to believe that it is unworthy of 4 
psychotherapist, perhaps also unscientific as well as undignified, to 
give sessions ‘for the public.”* The fact is that the most revolu- 
tionary change in therapeutic counseling since psychoanalysis '$ 
linked with open sessions for the public. My giving of open sessions 
ar, one session 


e for the entire 
idents 


* At the New York Psychodramatic Institute a ten weeks’ semin: 
weekly, is given by me as is customary for experts in a specialty. The fe 
seminar is-$15.00. The fee for a single admission is $1.65. The public consists of stu 
of the universities and colleges in and around New York. 
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had enormous consequences because it broke with two sacred cliches, 
first that of the counselor being closeted in a secret chamber with the 
client and no other client or counselor sharing in the experiences evoked 
in the session; it was and had to remain strictly confidential. The 
public session opened the door wide for the participation of larger and 
smaller groups in a common therapeutic experience. Second, not making 
any record during the session which was to remain sacred and discreet 
for the sake of the client. But once the privacy of the one-to-one 
therapeutic relationship was broken it became a foregone conclusion 
that simultaneous observation and recording of the therapeutic proc- 
ess should be permissible. This marked the way towards the objecti- ~ 
fication of the interview. The clinical advance inaugurated by the public 
Session meant automatically also an advance in objectively recording 
and analyzing its operations. The principle of electric recording taking 
Place simultaneously with the therapeutic process was announced by 
me seventeen years ago and followed since then throughout my writings 
and frequently practiced—it is curious therefore that Snyder should give 
the nondirective counselor priority in this: 3 
A Reactions witnessed by the psychologist and revelations given by any in- 
ividual during a course of interviews, casual or planned, are, at least from 
the point of view of cooperative, controllable research, of little value since they 
are after the event merely memory impressions of the observer. The multiform 
interpretations offered by the subjectivists in psychology are without proper 
©monstration and reconsideration as long as they do not conserve the moment.’ 
t was suggested frequently therefore that “a talking machine should photo- 
graph the process” and that we should make systematic use of this machinery 
Ot Personality recording.* 


Psychodramatic procedure, being an offshoot of sociometry, has to 

© viewed’ and evaluated with sociometric methodology and experi- 
mentation as a background. Terms like sociodynamic effect, social atom 
and tele can appear esoteric and intuitive only to one who has not ac- 
Wuainted himself with the methods by which they have been explored. 


N is Moreno, J. L. Application of the group method to classification. New York: The 
ati 


i onal Committee on Prisons and Prison Labor, 1931, p. 16. (Current edition: Group 
ethod and group psychotherapy. New York: Beacon House, 1946, p. 16.) 
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LANDIS, Carney, & BoLLEs, M. MARJORIE. Textbook of abnormal 
psychology. New York: Macmillan, 1946. Pp. xii+576. 


This is, for me, the best book of its kind. Iam using it in both under- 
graduate and graduate courses in Abnormal Psychology. It is compre- 
hensive and scholarly. It is a serious text, not the chatty type; though 
clearly written, it has the solid character of‘an excellent technical sum- 
mary. 

It is an eclectic text. No single system of thought in psychiatry oF 
psychology is preferred. Perhaps eclecticism is the only tenable position 
in the field at the moment, but such an approach is bound to seem 
desultory and, at times, confusing. I would not use the book as a text if 
I could not myself present a dynamic view of human personality to 
supplement the too-neutral picture here given. If an individual lecturer 
does not feel up to giving such a picture on his own account, perhaps he 
should use the Landis and Bolles text in conjunction with some other 
book which emphasizes a more dynamic ontlook. A tremendous ad- 
vantage of Landis and Bolles lies in scope, in the great amount of thor- 
oughly metabolized information which is presented. 

Landis and Bolles are much impressed by the twin data and by the 
data on inheritance of schizophrenia. I am still stubbornly uncon- 
vinced. Kallmann’s study on The Genetics of Schizophrenia scems to 
me full of assumptions which need to be combed out by an expert 1n 
statistical design. Some ingenious skeptic with good training should 
attack this problem. If Kallmann’s method should prove to be fallible 
we could spare ourselves much confusion by learning this at an early 
date. A single misguided but authoritative-seeming research can divert 
scientific activity from its proper goals for a considerable period. De- 
spite such historical studies as Kallmann’s, I shall remain in doubt about 
the inheritance of mental disorder until we have a science of learning 
applied to the problems of childhood and personal development. 

The organization of the text is novel and useful. Psychoses, neur 
and mental deficiencies are dealt with first. Then follow sections 0” 
heredity, cultural and personal development. Then the authors use 4? 
interesting device. They go back over the abnormal data from the 


oses, 


standpoint of the psychologist, presenting chapters on disorders 
sensation, speech, volition. In the case of speech disorders the age 


would be like this: In the early chapters concerning psychoses, 
slurred speech of the paretic or the labored vocalization of the depress? 
person will be duly noted. In the later chapter on speech, all speec 
deviations will be reviewed as a group and will be related to relevan™ 
psychological hypotheses and experimentation concerning speech deve E 
opment. These chapters give a specific content to the claim of the psy 
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chologist for a special competence in dealing with the phenomena of 
mental disorder. 

At the end of the text some omnibus chapters appear—on the rôle 
of the brain, legal psychiatry, and psychotherapy. The chapter on 
“The Law and Mental Abnormality” is a characteristic example of the 
thoroughness and general cultural interest of this book. 

I am sure that my own research preoccupations will never permit 
me to undertake the tremendous task of writing a text in Abnormal 
Psychology. If they would, however, I would like to imitate Landis 
and Bolles in certain respects. I wish my imagined text would show as 
much of hard work, effort at neutrality of presentation, and broad base 
of facts as does theirs. I am sure I would be appalled at the’condition- 
ality of our information in this field. Most writers show an entire ab- 
Sence of methodological sophistication. Here and there one finds per- 
centages cited, but the ideas of reliability of classification and validity 
of relationship among signs and events are woefully absent. The 
characteristic standards of the psychologist have never really been 

ought to bear on the field of Abnormal Psychology. The physicians 
Who are the major reporters of evidence in this field inherit data from 
the Pre-clinical biological sciences but rarely understand or use with 
Independent force the methods of these sciences. The writer in this field 
as to be a sort of one-man reviewing stand, attaching his own estimates 
of Probability to the data presented. } 
am sure my text would make a greater effort toward a behavioral- 
Psychoanalytic synthesis of life history materials. The lack of this in 
andis and Bolles is fairly plain, especially in the clinical histories, 
Which tend (as a result, in my opinion) to be dull and unilluminating. 
Would strugele to apply the “laws of learning” to the problem of child 
Socialization, and to find in childhood the deviant habits which, if 
emphasized, might become traits in the later constellation of a mental 
'sorder, } think this striving would drive me to write a more biassed 
an contentious book than these authors have achieved, and this 
Might well be a disservice to science. ; P 
t I think it would be worthwhile to put in such a text a reminder to 
the teacher that audio-visual aids can perform a useful rôle in a course 
normal Psychology. Some of the better films on particular topics 
t be referred to. It might also be noted that anyone with a sound- 
ding gadget of modern fidelity can easily record characteristic 
interviews with mental patients, and thus achieve the tremendous 
ue of the auditory stimuli in defining the neurosis or psychosis. 
wever, I need hardly remind the reader that the bird in the hand 
Provided by Landis and Bolles is worth a flock of these birds in the bush 
my imagination, 
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Joun DOLLARD. 
Institute of Human Relations, Yale University. 
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Horsey, J. S. Narco-analysis. London: Oxford Univ. Press, 1946. Pp. 
vii+134. 

“The purpose of this book is to describe a technique which (the 
author) devised in order to shorten the course of deep mental analysis 
and psychotherapy.” Since psychoanalysis, because of its long duration 
and expense, is not practical in many situations, a method is provided 
utilizing drugs which overcome the initial resistance of the patient by 
“loosening his psychic tension’ and thus facilitating the analysis of the 
patient’s mental content. y 

After a brief survey of the history of narco- analysis, the author dis- 
cusses the methods of hypnosis and indicates their basic shortcoming 
and the need for a method of inducing narcotic hypnosis. This is fol- 
lowed by a discussion of the chemical structure of the two different 
types of barbiturates—those characterized by anticonvulsant properties 
(containing the phenyl radical) and those characterized by hypnotic 
effects (containing only aliphatic substituent groups). He then proceeds 
to describe the application of sodium amytal to incommunicative schizo- 
phrenics, resulting in their return to normality for about two minutes 
during which they discussed their illness and future plans. This breaking 
through of the psychotic shell of the patient enables better contact 
later, so that study, diagnosis and even therapy can be introduced with 
the most recalcitrant of patients. This method is designated as the 
“drug-analytic”’ method in contrast with narco-analysis. The drug- 
analytic method consists essentially in giving intravenous injections 0 
a barbiturate to produce deep narcosis, and in utilizing the patient's 
‘subsequent remarks as a basis for making better contact with him. The 
narco-analytic method is essentially psychotherapy conducted under 
light narcosis. 

This chapter is followed by a section on narco-analysis ir 
hospital, out-patient clinic, and in wartime. . r 

The particular psychological mechanism in which narco-analys!§ 
operates is then discussed. Abreaction and transference are both and” 


Iyzed with reference to the proper role that narco-analysis may play 4 
js an 


na mental 


each. Illustrative case histories of schizophrenia, dream-analys 
simulation and malingering are given. 

In his final chapter the author points out that in choosing a method 
for therapy the choice will depend upon the type of patient, time avail- 
able, and inclination and experience of the physician. Hypno-analysi§ 
may be preferable in some cases, he says, since it does not depend upon 
the use of any drug or any physical means and the patient is fully com 
mprovement is due entirely to his own effort and not to 


scious that 11 3 ffor Be 
some mysterious agency with consequent absence of insight. For Te 


experienced physicians, however, the drug-analytic method is to i 
preferred and the experienced therapist can use it to bring about shor 
cuts in development of contact. Furthermore, the limited number g 
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cases to which hypno-analysis is applicable limits the usefulness of the 
method. Throughout the author stresses that the advantages of drug 
therapy itself would soon be lost if psycho-therapy were not introduced 
and utilized to its greatest extent. 

JOSEPH ZUBIN. 


Psychiatric Institute, Columbia University. 


Lowrey, Lawson G. Psychiatry for social workers. New York: Columbia 
Univ. Press, 1946. Pp. ix+317. tə 


The modern social worker is in need of knowledge of psychiatric 
concepts and psychotherapy. What knowledge to serve what purposes? 
Out of his extensive experiences in the child guidance work, Dr. Lowrey 
is in a position to know. His Psychiatry for Social Workers represents 
an effort to satisfy this need in the light of his experiences. The book 
Presupposes a familiarity with psychiatry by way of one course or the 
Teading of a single text-book in psychiatry. The author does not intend 
the volume to serve as a book on the origin and nature of human be- 

avior nor does he want the book to be a text in social psychiatry. What 
he does want is a volume that will serve social workers by furnishing 
them with knowledge of indicative symptoms of deviations in mental 
UNctioning with special reference to differential symptom pictures. The 
Consequent content selected includes a description of the major types 
B Psychoses, neuroses, mental deficiency, epilepsy, psychopathic 
Personalities, and behavior disorders (due both to internal pathology 
as well as to reactive behavior disorders). These are presented under 
relatively static chapter captions. Illustrative are: “The Data of Psy- 
chiatry,” “Clinical Combinations of Symptoms,” “The Convulsive 
'Sorders,”” “Psychopathic Personality,” “Schizophrenia,” and “Cy- 
Clothymic Psychoses.” And these topics are relatively well considered 
Or the purpose. f : 
any features that are not suggested by the captions are also in- 
cluded. ` Illustrative special features worthy of mention include: a 
telling use of admission rates to indicate the significance of the various 
Mental diseases: a concise treatment of delinquency; a clinical descrip- 
“on of the developmental stages without overdoing it; a consistently 
*alistic and dynamic consideration of most problems; a distinct bias 
al favor of breaking through the unreal and nonexistent barriers of 
f.emination and treatment; a critical attitude towards the examination 
Or examination's sake, history for history’s sake, and other empty 
8estures; and a distinctly favorable response towards the application 
of the differential approach. By this approach therapists proceed in 
te with needs revealed as the case itself unfolds, varying the tempo 
nd sequence as indicated. var “The Dat 
ofp Sychological examination is dealt with in the chapter on “The Data 
Sychiatry” and is mostly on the level of psychometric presentation, 
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as generally found in child guidance clinics directed by psychiatrists. 
The chief emphasis is on intelligence tests, performance tests, reading 
disability tests, and the Rorschach. Tests are barely mentioned and 
not too comprehensively considered. The Healy and Bronner manual, 
Terman’s Measurement of Intelligence, Wechsler’s Measurement of Adult 
Intelligence, two or three references on the Rorschach, and one reference 
on reading are all that psychologists would find on the strictly psy- 
chological phases of the examination. ‘The Data of Psychiatry” in- 
cludes anamnesis, social examination, outline of a consideration of 
race differences, physical examination, as well as psychometric examina- 
tions. 

All in all, problems considered are dealt with realistically as well as 
dynamically. In general, the book promises to satisfy the purpose for 
which it was written; namely, to serve as an intelligent guide for social 
workers. ` 

H. MELTZER. 
Psychological Service Center, St. Louis, Missouri. 


ARTHUR, GRACE. Tutoring as therapy. New York: The Commonwealth 
Fund, 1946. Pp. ix+125. 


In the current literature the role of the psychiatrist in solving school 
behavior problems has been given considerable stress. The role of the 
psychologist as a non-directive therapist has received recent attention. 
Dr. Arthur places a renewed emphasis on the therapeutic role of the 
psychologist as planner of remedial and individualized tutoring. Het 
book should be read by those interested in therapy techniques, child 
guidance programs or designs for curriculum flexibility to fit individual 
needs. h 
Dr. Arthur’s principal theme is that the right kind of tutoring: 
provided at the right time, can work wonders for a child’s adjustment 
at school, at home and in his social life. Examples of effective therapy 
are drawn from her experience as school psychologist and tutor, chi 
guidance clinic psychologist, and director of a W.P.A. tutoring project. 
Her discussion of finding tutors, remedial methods, program rearranging, 
etc., is generously illustrated with individual cases summarized wit 
affection and humor. Many of the cases are followed for the entire 
course of schooling. An evaluation is made not only of the immeciate 
improvement in tutored subjects but of the subtle, long-term psycholo- 
gical adjustments as well. : 
Perhaps the chief defect of the book is that it paints too rosy a P!° 
ture of tutoring therapy. Dr. Arthur emphasizes that good tutoring 
includes an extensive study of the child, a tutoring plan worked out by 
jenced psychologist, a tutor carefully chosen with the child $ 
hological needs in mind, plus frequent retests and prog- 
s is customary successful cases are cited. A more obs 
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jective tone could have been provided by including certain complete 
reports of children who did not respond adequately to the tutoring. 
That there were such children is obliquely suggested in the number of 
cautions which Dr. Arthur provides. For example: (1) the psychologist 
takes the sole responsibility for the tutoring plan, leaving the tutor free 
to side with the child; (2) tutoring programs should be tapered off 
rather than discontinued, etc. 
Altogether this book provides a timely emphasis, certain good diag- 
nostic insights, tutoring hints, and a wealth of illustrative cases. t 
ADELLA C. Youtz. 
Teachers College, Columbia University. 
WITMER, HELEN LELAND (Ed.). Psychiatric interviews with children. 
New York: Commonwealth Fund, 1946. Pp. x+443. 


In her introductory remarks the editor states that “the aim of this 
book is to show various ways in which psychiatrists in child guidance 
clinics utilize the therapist-patient relationship for therapeutic ends.” 
In each of ten case studies, the details of the successive interviews be- 
tween child and psychiatrist are set down according to a uniform plan. 
The form of presentation is similar to that used by Carl Rogers in the 
case reported in his Counseling and Psychotherapy but these are not 
verbatim reports. No notes were taken while the interviews were in 
progress but presumably each was written up shortly after its conclu- 
sion. 

The book is designed to illustrate the changing approach in handling 
children’s behavior problems which has been taking place during the 
Past ten or fifteen years. Before 1935, child guidance workers depended 
chiefly upon discovering and modifying those factors in the child’s 
environment that seemed to be primarily responsible for his difficulties. 
The weakness of this approach lay in the fact that in so many cases it 
Proved impossible to put the needed changes into effect. 

As psychoanalytic methods for children became more completely 
developed and with the increasing popularity of play therapy the point 
of view shifted. Attention became focussed upon the child rather than 
Upon his surroundings. Direct observation of his behavior by the psy- 
chiatrist himself was substituted for second-hand reports. An intimate 
and dynamic relationship between patient and therapist could then be 
Created in which the child would feel free to express his fears and his 
hatreds, his anxieties and his desires without fear of rebuke or punish- 
ment. The establishment of such a relationship requires time and pa- 
tience as well as skill. In the 10 examples given, the number of reported 
interviews rom 9 to 53. y 

The Take atl into three parts. Part I was written by the 
editor, It is entitled, Introductory Comments and contains three chap- 
ters. The first is largely historical, describing the conditions that led 
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up to the establishment of child guidance clinics and the changes in 
theory and procedure that have taken place since that time. The 
second chapter describes the kind of patients commonly accepted for 
treatment and the problems most often displayed. Chapter III is 
entitled, Relationship: the Dynamic of Therapy. It presents the modern 
theories underlying psychiatric work with children which, as indicated 
by the title, have one basic point in common. All demand the establish- 
ment of a feeling of security in the presence of the therapist such that the 
child becomes free to “be himself” without.fear of censure or loss of 
“face.” Different psychiatrists utilize this relationship in various ways 
but all recognize it to be essential for the success of the treatment. 

Part II which constitutes the main part of the book begins with an 
introductory chapter by the editor describing the criteria used in the 
selection of cases. These were as follows: (a) the psychiatrists making 
the reports should be known to the editor as having had thorough train- 
ing in child therapy and extensive experience in child guidance work, 
(b) only those cases should be included in which the treatment proved 
to be at least reasonably successful, (c) the cases selected should be 
fairly representative of the type commonly brought to child guidance 
clinics and should not include any unusual or bizarre examples. As far 
as possible they should represent a wide range of ages, problems and 
home backgrounds and (d) no foster children should be included; all 
must be living with their own parents. 

The case reports which follow were selected by the editor in coopera- 
tion with the psychiatrists who prepared them. The following psy- 
chiatrists contributed to the study: Frederick M. Allen, Phyllis Blanch- 
ard (two cases), Lydia N. G. Dawes, Hyman S. Lippman, Martha wW. 
MacDonald, H. B. Moyle (two cases), Beata Rank, and Robert A. 
Young. The general plan of presenting the reports was outlined by the 
editor but the actual case material was written by the psychiatrists. 
For each case there is given first a brief paragraph describing the chi 
and his environment, the type of problem presented and the manner 
in which the psychiatrist utilized his relationship with the child in order 
to work out a solution. The editor then elaborates upon this materia 
in order to show her reasons for selecting this particular case for inclu- 
sion in the study. This is followed by the social worker’s report of her 
initial findings after which come the psychiatrist’s reports of the suc- 
cessive interviews. The reports are interlarded with copious footnotes 
in which the psychiatrist gives his reasons for directing the interview 
as he did, and presents his interpretations of the child’s reactions. The 
majority of the case reports are interrupted midway in order to give 4? 
account of the social worker’s progress in working with parents. Each 
report concludes with a brief account of follow-up work over a varying 


period of time. , i i 
In Fart III entitled Comments in Conclusion Dr. Witmer reiterate 
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her earlier warning that this book should not be regarded as a report 
of an experimental study nor should the accounts of the psychiatric” 
interviews be looked upon as models to be followed by rule-of-thumb. 
The cases selected are not representative in the statistical sense of the 
word but only in the popular sense in that the children and their prob- 
lems are similar to those of many other cases brought to child guidance 
clinics. The fact that only cases with reasonably successful outcome 
were chosen obviously means that only a limited part of the usual intake 
of a child guidance clinic would be represented at all. With these limita- 
tions in mind, Witmer then discusses certain points of similarity and 
dissimilarity in the way the various cases were handled, and stresses the 
fact that superficial resemblance does not always mean basic.similarity. 
Differences in treatment arise in part from differences in the concepts 
held by various therapists as to how their relationship to the child can 
best be utilized; in part from differences in the children and in their 
problems. Always, however, the effort must be mutual. The psychia- 
trist does not attempt to solve the child's problems for him; he only helps 
him to find a solution for himself. 

The book is of interest to psychologists for two reasons. In the first 
place the boundary between psychiatry and clinical psychology is far 
from clear-cut. The aims are similar and the procedures overlap. 
Clinical psychologists will therefore find in it many useful suggestions 
and will be stimulated by its argument, whether or not they are in 
agreement with the point of view expressed. Psychologists who are 
not clinicians will also enjoy the editor’s clear and scholarly exposition 
of principles and theories with which all should have some familiarity 
even though many will hesitate to accept them without qualification. 
Although statisticians may cavil at the tacit assumption that the re- 
Ported improvement in the attitudes and behavior of the children which 
ensued upon treatment was due to the treatment and not to other fac- 
tors taking*place simultaneously, the book remains a notable contribu- 
tion within its field since it is the first detailed account of long continued 
therapeutic treatment of children by means of interviews in which the 
Successive steps and the reasons for taking them have been presented 
in an orderly and systematic manner. 

FLORENCE L. GoopENOUGH. 


University of Minnesota. 


Wo rr, WERNER. Personality of the preschool child. New York: Grune & 
Stratton, 1946. Pp. xvi+341. 

This book is, as the author remarks, a result of “the urge to attempt 

an approach to the hidden forces and motivations of child behavior.” 

hroughout the publication the author emphasizes two themes, that 

child behavior is a meaningful and interpretable product of the dynamics 

Of the child’s personality, but that the interpretation must procced from 
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a recognition of the essential differences between the child and the adult. 
Even when the child’s behavior is “sensible” in adult terms, it is danger- 
ous to assume that it is the result of adult-like personality dynamics. 

Dr. Wolff divides the book into three sections. The first is a dis- 
cussion of the characteristics of children based upon observations 0 
free play, supported by references to the findings of other investigators. 
The second section is a report of a number of experimental procedures, 
for the most part interpretation and discussion of the pictures drawn 
by children under experimental conditions. Ina final section the general 
theoretical conclusions are summarized. 

It is not possible in this review even to list the specific experiments 
performed. or to report the details of the author’s theories. He con- 
tinually emphasizes the fact that the child does not display discrete 
behavior which is a product of one aspect of his personality, independent 
of the rest. Where the adult is able to perceive a situation or perform 4 
task relatively uninfluenced by random thoughts and extraneous stimuli 
the child’s perception and behavior is not so differentiated. This char- 
acteristic plus the fact that the child's behavior omits may of the con- 
nections between ideas makes for difficulty in interpretation. 

To some extent the book itself shows the same lack of differentiation: 
It is extremely difficult to describe behavior which is so diffuse and $° 
greatly influenced by numerous factors and still to keep the description 
itself clearly organized. Another characteristic of the book which makes 
it difficult to read is the small amount of space given to some topics: 
The text is most readable and most convincing in the long descriptions 
and discussions of the drawings of an individual child; it is least satis- 
factory when the author tries to summarize his opinions on some com- 
plex topic in a few categorical sentences. 

Though difficult to read, the book is worth the effort; this reader 
came away with the opinion that Dr. Wolff has, by his experimenta 
techniques and demonstration of interpretative methods, niade an im? 
portant contribution to the psychology of the preschool child. 

There is a bibliography of 628 titles and an excellent index. 

ALFRED L. BALDWIN: 

Fels Research Institute. 


RADKE, MARIAN J. The relation of parental authority to children’s be 
havior and attitudes. Univ. of Minnesota, Institute of Child Welfat? 
Monogr. No. 22. Minneapolis: Univ. of Minnesota Press, 194° 
Pp. x +123. 


In recent years psychologists have been attributing increasing 
portance to the cultural milieu as a formative factor in various asp 
of the personality. It has been said that family constellations, met Q 
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of discipline, factors of security or insecurity, and so on, influence a 
person's attitudes and the adequacy of his emotional adjustments. 
Radke, in this book, reviews the literature—clinical, semi-experimental 
and experimental—of the initial attempts to evaluate accurately the 
effect of cultural influences in the area of parental-child relationships, 
and in the use of projective techniques in studying personality. She 
then goes on to report an experiment of her own, on 43 four-year-old 
nursery school children and their parents. She has selected for her study 
one small area of the culture, that of parental authority and discipline, 
and a small fairly homogeneous upper-middle-class population. 

From the review of the literature, the researches in this field are 
shown to be thus far tentative, inconclusive, and in some respects con- 
tradictory. Radke’s experiment is devoted in part to an evaluation of 
the methods used. She employs interviews with both the parents and 
the children, teachers’ ratings, controlled behavior situations, and pro- 
jective techniques (in the form of doll play and responses to pictures). 
The various kinds of data serve to check each other, and to assist in the 
evaluation of each, as well as to indicate the influence of some of the 
cultural influences observed. 

It was found that the children reported accurately on many of the 
objective facts of parental behavior and discipline, but that their inter- 
pretations were limited. 

Several factors interfere with any definitive demonstration of strong 
relationships. For one thing, as Radke points out, relatively small 
differences are to be expected when the population studied is so homo- 
geneous in many aspects of its culture. One important point for con- 
sideration in further research in this field is the repeated finding that the 
children’s understanding and interpretation of the home situations, and 
of specific disciplinary measures, are often quite different from the inter- 
pretations and intentions of the adult experimenters and disciplinarians. 
Care needs to be taken in evaluating the situation from the child’s 
point of view if we wish to understand the effect of his environment upon 
him. Even with these limitations, the data show relations between the 
degree of parental restrictiveness and child behavior. These differences 
are shown to be similar in many respects to those of other experimenters 
using similar classifications of adult authority over children. There are 
other interesting findings on sex differences in attitudes and disciplinary 
measures. 

The experiments reported here are carefully done, with due regard 
for scientific procedures and controls in investigating characteristics 
which until recently have been based largely on clinical experience and 
intuition. They indicate important considerations for future research , 
in this area, and for parent-education in aspects of child discipline. 

Nancy BAXLEY. 


University of California. 
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MAURER, KATHERINE M. Intellectual status at maturity as a criterion for 
selecting items in preschool tests. Univ. of Minnesota, Institute of 
Child Welfare Monogr., No. 21. Minneapolis: Univ. of Minnesota 
Press, 1946. Pp. ix+166. $2.50. 


The problem attacked in this monograph is that of selecting the 
subtests of the Minnesota Preschool Scale (both forms) which are most 
predictive of intelligence at maturity as measured by the Wells revision 
of the Army Alpha. Of 1091 cases on file who had been tested prior to 
age six, 226 were located and tested at ages 16 to 22. A grand total of 
about 700 biserial correlations, between Alpha scores and the Preschool 
Scale items, were computed on 180 cases subdivided into nine age groups 
according to age at time of their preschool test (N’s varied from 9 to 33). 
The records of 46 cases (every fifth) were held for “validating” the 
“predictive” scale made up of the subtests selected on the basis of the 
item analysis. 

Two points lead the reviewer to question the accuracy of the sta- 
tistical analysis. With sizable standard errors (up to .30), it is very 
doubtful that the 400 r’s under .40 (200 under .20) would all be on the 
positive side of zero; yet no negative biserial values are reported. About 
one-tenth of the 700 given standard errors, for biserials, are erroneously 
low. 

On the basis of the item biserial 7’s, 17 of the 29 subtests of each 
form were retained as “predictive.” These 17 represent about 75% ° 
the 224 scoring items per form. The preschool test blanks of the 46 
validating cases were rescored on these selected items, and the correla- 
tions of Alpha with selected tests were compared with the correlations 
yielded by all the preschool subtests. With 7’s based on N's of 3 to 
per age group, the results are inconclusive: it cannot be said that the 
scores based on selected tests are any more predictive than scores base 
on all the tests. This lack of conclusiveness also holds for tne findings 
based on a supplementary validation group with Stanford-Binet IQ 5 
at ages 11 to 15 taken as measures of near terminal intelligence. 

A better procedure would have been to choose on the basis of t 
item analysis the best half of the items as predictive, with the remaining 
half as nonpredictive, then to compare the validity of the predictive 
versus that of the nonpredictive tests, instead of the predictive versus 
the total. The difficulty of eliminating the age factor so as to have @ 
validity coefficient based on all 46 cases could be overcome by rescor ing 
a sufficient number of typical Minnesota Preschool Test blanks sei 
basis for standard scores, at each age level, for the predictive item f 
These standard scores could then be correlated with later Alpha sta? 
ing, also expressed as standard scores by age level. 

This monograph represents an enormous amount of work on a very 
t, and likely fruitful, line of research Far more cases will be 
for checking the reasonable hypothesis that preschool scales 
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contain “dead wood” as far as the prediction of terminal intelligence is 
concerned. 
Quinn McNEMAR. 


Stanford University. 


Forp, Mary. The application of the Rorschach test to young children. 
Univ. of Minnesota, Institute of Child Welfare Monogr., No. 23. 
Minneapolis: Univ. of Minnesota Press, 1946. Pp. xii+114. $2.00. 


Dr. Ford’s study was initiated in 1936; many of its goals and meth- 
ods were set at that time. In the face of the enormous proliferation of 
Rorschach interest, Rorschach studies and Rorschach users (and mis- 
users) in the past decade, what would once have been of pioneering 
significance is now more in the nature of confirmation, elaboration (or, 
occasionally, refutation) of studies published in the interim. 

The first problem Dr. Ford set herself, to refine and simplify Ror- 
schach administration, was not resolved by research, but rather by pre- 
liminary experience. Most notable modifications are the use of a trial 
blot à la Hertz, a prohibition against turning the cards, and acertain 
amount of “prodding” for further responses. Undoubtedly there will be 
disagreement regarding the need for such changes. Although she ap- 
pears to have been unusually successful in carrying through her in- 
quiries, Ford, like others who have worked with young children, sug- 
gests the desirability of a brief inquiry at the time each card is shown. 
The study leaves open the question of the lower age limit for the ad- 
ministration of the Rorschach, inasmuch as no children under three 
were tested. Dr. Ford confirms and further develops Klopfer and 
Margulies’ description, in 1941, of the early “stages” of Rorschach re- 
sponse. : 

Her second problem was the study of “variations in the test de- 
terminants” (scoring categories) and their relation to sex and age. No 
normative value is claimed because the 123 subjects “represent a highly 
selected group both in intelligence and in socio-economic status” (p. 39). 
Existing adult and adolescent norms were used. This section cannot but 
remind the reader of the regrettable lack of final agreement on scoring 
nomenclature. In the few instances where the present study is at vari- 
ance with the Klopfer-Margulies’ findings this appears to have been a 
determining factor. (For example, including among whole responses the 
“cut-off whole,” by Klopfer and Margulies but apparently not by Ford; 
or again, “One possible explanation for the low incidence of C responses 
in... the present study is the fact that Cn responses were not included 
in C responses” (p. 47) as is customary and was the case in the other 
report.) Certain other discrepancies raise the more serious question of 
disagreement on scoring. For example, the author points to the higher 
incidence of M in her study in spite of the fact that “all the investigators 
including the writer, use Rorschach’s instructions in scoring M re- 
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sponses” (p. 54). Yet in the three sample protocols several instances ap- 
pear to the reviewer in which movement was scored even though not 
“human or human-like.” (E.g., “lots of bugs crawling around.’’) The 
reviewer would also quarrel with the scoring of some of the color re- 
sponses. In general, sex differences appear unimportant. However, 
Ford shows that many important scoring categories bear a clear-cut 
relationship to chronological age between 3 and 8. These findings, along 
with others in the literature, point to the need of establishing norms 
perhaps for each year-level under 7 or 8. * 

An attempt was made to validate the relationship between intelli- 
gence and certain specific determinants suggested by Rorschach. Satis- 
factory correlations were found for most of these but not for W and %A- 
These exceptions ‘‘make sense” in view of the fact that children’s whole 
responses are often crude and undifferentiated, unlike the more struc- 
tured W’s of adults, and in view of the fact that animal concepts in 
children’s thinking do not have the banality that they do for adults. 

A third contribution of the study was in establishing test-retest re- 
liabilities for the specific determinants of the Rorschach with young 
children. Most of the coefficients are of the order of +.7. As Dr. For 
points out: 

The evidence of this investigation suggests that with further refinement 
of procedure and scoring, the reliabilities of the various test determinants may 
be expected to approximate those of other standard tests for young children. 
Even at the present time, under the conditions set up in this study, the reli- 
abilities of most of the determinants are fairly satisfactory, especially when 
one considers that the interpretation of test results does not depend upon any 
one determinant but upon the balance and interrelationship between deter- 
minants. 


Her fourth major aim, the study of the “validity of some of the de- 
terminants of personality,” Dr. Ford undertakes with great modesty: 
but it is difficult to evaluate because data so meticulously supplied in 
other sections of the monograph are inadequate here. As one example, 
in evaluating emotional adjustment on the Olson scale we are told that 
the subjects were “divided into three groups on the basis of emotiona 
adjustment as indicated by the Rorschach Test” (p. 82), but we do not 
know how this was done. A new scoring concept of organizational links 
(OL) is tantalizing because it is always referred to allusively. Define 
in the glossary as “The number of verbalized, logical connections within 
a response Or within a record,” this would appear to be a valuable indi- 
cation of complexity of response. However, the sample protocols make 
it difficult to determine whether one counts links or items. “A man 
walking a fence” receives a score of 2 OL; “a partly burned tree trunk 
is also scored 20L; while “a balanced snake” is scored 1 OL. For the 
clinician there 18 Som” disappointment in the mass treatment of data 
and the paucity of individual records or even scoring lists. However, he 
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will find much of interest in the discussion of administration of the 
Rorschach and on the frequencies of locations, determinants and con- 
tent. The student of personality development will discover certain in- 
teresting findings—and rather more interesting suggestions—for the 
use of the Rorschach to study development in general, and the develop- 
ment of the process of logical thinking in particular. 


L. JOSEPH STONE. 
Vassar College. 


KAaBaCK, GoLDIE Ruta. Vocational personalities: an application of the 
Rorschach Group Method. Teachers College, Columbia Univ. Contr. 
to Educ., No. 924. New York: Bureau of Publications, Teachers 
College, Columbia Univ., 1946. Pp. x+116. 


The problem of the relationship between personality and vocational 
choice has been of long standing interest. In the above research an 
aspect of this problem is studied by means of the Group Rorschach 
Method. The vocations investigated are primarily Pharmacy and Ac- 
countancy although comparisons are also introduced with some other 
groups for which published data are available. The author selected 75 
subjects in each of four groups, consisting of practicing pharmacists, 
practicing accountants, and students preparing for the two professions. 
In addition to taking the Group Rorschach Test, each subject answered 
a brief questionnaire pertaining mainly to vocational aspirations, train- 
ing and experience, and to leisure time activities. Practitioners averaged 
37 years in age. Pharmacy students averaged 20 years and accountancy 
students 18 years. 

The bulk of the monograph is devoted to a meticulous statistical 
analysis of inter-group differences based on most of the large variety of 
scorable Rorschach factors that have been recognized to date. The 
statisticalsanalysis is conscientiously performed and well presented but 
the scope of the study does not go much beyond statistically defined 
problems. A limitation is also placed on the data by dealing with the 
Rorschach factors in a purely quantitative fashion. The interpretations 
of the data are thus based only on the sheer number or the proportions 
of various significant items. The omission of qualitative considerations 
can at times be disastrous to an intensive character analysis. 

ə» A number of statistically significant differences are obtained be- 
tween pharmacists and accountants and between the corresponding 
student groups. The accountants have a higher level of intelligence (as 
judged by a number of different items) and a greater degree of caution. 
The pharmacists have a greater degree of anxiety. There is, however, a 
considerable amount of overlap between the distributions of most of 
the Rorschach factors. The author concludes: 


The most notable feature of the study is that both professions attzact people 
of diverse personality characteristics (p. 83). 
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Each profession appears to attract persons with varying degrees of emo- 
tional stability, introversiveness, extratensiveness, intellectual level, etc. (p. 
82). 


A fuller understanding of the relationship between personality and 
vocational choice requires a broader investigation than the one reported 
here. Vocational choice is related to the prestige, economic reward, 
economic security afforded by the vocation as well as to the skills de- 
manded by it. The prestige may vary with the social stratum from 
wkich an individual comes, as may the standards which determine ade- 
quate reward and adequate security. An analysis of what the appeals © 
given occupations are could be the basis of more sharpened hypotheses 
as to which personality factors to investigate. An attempt to obtain 
such information, in the present study, through a questionnaire was not 
very successful. When asked the reason for their choice of profession, 
over 60 percent of the 300 subjects either did not answer this question, 
or said they selected the profession because they liked it or were inter- 
ested in it. Intensive interviews of a few subjects might have been mor@ 
fruitful, perhaps yielding a picture of the relationship between sel 
evaluation and occupational choice. Such findings as the greater anxiety 
among pharmacists could be clarified if there were greater understand- 
ing of the meaning of the profession as a way of life to the people in it. 

The discovery of a high degree of overlap between the pharmacists 
and the accountants, despite the wide differences in the nature of these 
professions, focusses attention on the limitations of a personality aP- 
proach to vocational choice in present day society. 

As the author indicates, the present study cannot be taken as a final 
statement. She points to the need of a more diverse sample than was 
used and to the need for further experimentation with the Group Ror- 
schach Method to clarify interpretations which were carried over bodily 
from the individual test. We would stress the value in future researc? 
of an intensive approach which would not limit personality description 
to the special characteristics of a single test and which would investigate 
motivational and social determinants as well as personality structure 

Max HERTZMAN. 


College of the City of New York. 


LAWTON, GEORGE. Aging successfully. New York: Columbia Univ. 

Press, 1946. Pp. xiv+266. 

DE GRUCHY, CLARE. Creative old age. San Francisco: Old Age Cou 

ling Center, 1946. Pp. 143. 

The normal “problem group” in our society is the later-maturity °" 
old-age group; and this group is increasing at a rate which compels at- 
tention, if not alarm. Thus, “in 1950 the age group forty-five and a 
will make up 30 percent of the population” (Lawton, p. 257). By 1980, 
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the number of persons over 45 will have risen to about 40 percent of the 
total population.* If psychology were more alert, the science of geron- 
chology—if one may coin guch a term for the psychology of aging— 
would occupy a spot much closer the center of the psychological stage 
than it now does. 

Both the books by Lawton and de Gruchy are “popular,” in the 
sense that they are well-written, and aim to reach an audience that is 
more interested in practical application than in scientific proof. The 
book by de Gruchy is divided into two parts. The first part presents 
case-histories of seven individuals counselled by the San Francisco Old 
Age Counselling Center (founded by Dr. Lillien J. Martin); the second 
part describes principally the “SOA Farm” and the development of 
various avocations by a number of counselees. As R. G. Kuhlen states 
in his excellent Foreword to the book, the keynote of the San Francisco 
Counselling Center is the stimulation of goal-directed activity on the 
part of old people. Neither de Gruchy nor Lawton has much faith in 
the adequacy of merely recreational or “chronicidal” (time-killing) ac- 
tivity. “What is needed instead is the self-fulfillment of the individual, 
through securing for him an opportunity for growth in his preferential 
field of interest” (de Gruchy, p. 110). De Gruchy cites a striking ex- 
ample of the inadequacy of the merely recreational approach: 


In Long Beach, California, perhaps the town most fully populated by old 
men and women, the municipality has recognized some needs for the old. 
Many forms of suitable play have been established. Bowling and croquet 
lawns abound. There are open-air shelters for card playing, billiard tables, and 
the like. But watch the people who use them. They are killing time, perhaps 
even mildly amusing themselves; but they are as dour-looking, as slowed-down, 
as limp and lifeless as any other old age group found in less privileged locations 
(p. 88). 


Lawton’s book is more comprehensive than de Gruchy’s, aiming to 
provide, in an informal or conversational manner, guidance “on the 
most effective way of making the typical adjustments called for by later 
maturity” (p. 236). The word “typical” in this quotation jars the 
present reviewer, because it seems to imply some lack of recognition of 
individual differences. Both Lawton and de Gruchy give their enthusi- 
astic, general approval to an “activity” program: namely, the develop- 
ment of personally significant goals, purposeful activity, and future- 
mindedness (this advice to senescents sounds strangely like common 
advice to adolescents). Very likely such a prescription fits the needs, of 
most counsellors’ clients. But what about the underweight, cardiac 
neurasthenic who might benefit from more rest and less purpose; or the 


* DuBLIN, L. I. Longevity in retrospect and in prospect. In E. V. Cowdry (Ed.), 
Problems of Ageing. Baltimore: W illiams and Wilkins, 1942. “ 
+ The letters SOA stand for “Salvaging Old Age. 
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over-active oldster with plenty of purpose, who fails to recognize his 
own limitations and gets into everyone’s hair in the process? “Calm 
down” is occasionally as good advice as “get up!” Purposeful activity 
may well be essential to self-respect and social approval; nevertheless, 
the realistic adaptation to old age also demands the development of a 
certain degree of relaxation, of satisfied tolerance for idleness and di- 
minished functioning. The reviewer feels that Lawton’s and de Gruchy’s 
energetic activity-prescription must be applied with discrimination— 
most likely the very kind of discrimination which these consultants apply 
in their practice, but which they have failed to reflect in their books. 

Both Lawton and de Gruchy write with infectious enthusiasm and 
zeal. Both feel sure that their methods are highly successful. For the 
San Francisco Old Age Counselling Center, de Gruchy cites an actua 
figure: 9 percent failure, 91 percent success. The reviewer is surprise 
that de Gruchy does not present separate figures for men and for women; 
since the adjustment problems of most men (who pass from a paid oc- 
cupation to unflattering and unremunerative retirement) seem obvi- 
ously greater than the problems of most women (who simply continue 
as housewives). In any event, the figure of 91 percent success is very 
impressive; and as Kuhlen points out in his Foreword to de Gruchy $ 
book, ‘‘The danger of interpreting a transitory ‘pepping-up’ as a S!8" 
nificant change in personality and purpose has been avoided by refusa 
to evaluate success until at least two years have elapsed” (p. ii). Never- 
theless, there are still grounds for skepticism. Thus, the reviewer doubts 
that either de Gruchy’s or Lawton’s samples represent a cross-section 
of the aged population with respect to health, financial resources, g¢™ 
eral level of ability, special abilities, and general level of social skills- 
Furthermore, as Kuhlen points out, the success of old-age counsellors 
may be “partially explained by the probability that the old person W20 
bestirs himself sufficiently to seek aid and voluntarily to urdertake @ 
program of rehabilitation already has a latent motivation that can g 
capitalized” (p. ii). A counter-hypothesis is that only those with the 
more serious problems are likely to seek a counsellor; but the reviewe!s 
for one, would give Kuhlen’s suggestion the greater weight. In the case 
of Lawton’s clients, the reviewer gained the impression that these were 
typically in the younger old-age group (below 65, rather than above)— 
which again seems a factor favorable to successful adjustment oF Pen 
habilitation. 

Lawton’s chapter, “In youth prepare for age,’ 
deserves far greater emphasis than psychologists or ed 
given it. ‘Adult education seems limited in this respect, because— 


bout work that in the feverish 
lay. They 


tition, 


’ makes a point which 
ucators hav 


Most men and women get so concerned a 
desire to rise in the world they throw out the ballast of rest and p 
become creatures of one function, struggling unceasingly to meet compe 
and money becomes 4 goal for living rather than a means to it (p. 102). 
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Most self-help books are guilty of the fallacy of voluntarism. While 
there are undertones or overtones of voluntarism in both Lawton and de 
Gruchy, these authors should be complimented for preserving, for the 
most part, a sensible means-end approach. On the other hand, both 
authors seem over-enthusiastic or over-optimistic regarding the possi- 
bility of surmounting the problems and handicaps of old age. Enthusi- 
asm and optimism, however, are part of the therapeutic process, and 
(if not overdone) are probably justifiable in popular works. 

The literary quality of the two books calls for some special recog- 
nition. Especially in Lawton’s volume, there are some passages which 
remind one of William James. Three examples must suffice: 


...the life which lasts longer and is progressively emptier—this is a pe- 
culiarly modern form of torture. 

Caution and comfort become rallying cries as the old man races off to do 
battle without leaving his easy chair. 

Devotion to “remembrance of things past” is for writers legal tender of 
high purchasing value. But for an older person, it represents inflated currency; 


‘it buys nothing and if he tries to live on it he will starve. Only an active creative 


present has true purchasing power for an older person. 


Finally, it may be pointed out that both Lawton’s and de Gruchy’s 
books adopt the individualistic or psychological, rather than the socio- 
logical, approach. Both are books of counsel, not social reform. Lawton 
seems to recognize—but inadequately, we think—that the economical 
and practical solution of many of the problems of our “‘senior citizens” 
requires organized, governmental action. For de Gruchy, society and 
government apparently exist only vaguely in the background. Such a 
highly individualistic outlook runs counter to the view (held by virtu- 
ally all serious students in this field) that “not all of the answers to the 
problems of gerontology are to be found within the individual.’’* 

` H. S. CONRAD. 


College Enirance Examination Board. 


MICHOTTE, A. La perception de la causalité. Études de Psychologie, Vol. 
VI. Paris: Vrin, 1946. Pp. viii +296. 


The investigation of complex perceptual processes progresses very 
slowly. Rubin’s book on figure and ground and Katz's book on the 
phenomenal appearance of colors presented two important steps for- 
ward. Now there appears a book on the perception of causality by the 
Belgian psychologist Michotte which, like these older books, seems to 
open up a new field. The book is largely concerned with the answer to 
the question of what the stimulus configurations are which arouse in the 


* OVERHOLSER, W. Orientation. In Mental Health in Later Maturity. Supplement 
No. 168 to the Public Health Reports. Washington, D. C.: Government Printing Office, 
1943, 
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observer the impression that one thing is pushing or pulling another. 

A very ingenious method is used for producing the variation in the 
stimulus patterns. Lines of different shapes dre drawn on a disk turning 
behind a screen with a horizontal slit. A ring on the disk produces a 
stationary spot, a spiral produces a moving spot. In one of the basic 
experiments, a spot A appears in the middle of the slit. Another spot, B, 
appears from the left and moves toward the right till it touches A which 
has so far remained stationary; then B stands still and A moves to the 
right. This arrangement produces the phenomenon of B pushing A, the 
motion of A seems to be caused by the motion of B. The set-up allows 
changes in different variables: the time interval between the contact of 
A and B and the start of the motion of A, the speeds of A and of B 
and their relation to each other, etc. 

On the basis of many experiments Michotte concludes that the phe- 
nomenon “‘lancement” (pushing by impact), to which one third of the 
book is devoted, is a unified perceptual form like a shape and that it 
arises under specific stimulus conditions. In the same way the phe- 
nomenon “entrainement” (pushing during contact) is investigated. 
Michotte adds a further theoretical analysis which can only be indicated 
in a brief review. He states that the movement of the agent seems tO 
extend over the passive body, and that a unit formation between active 
and passive body plays an important role. Besides these fundamental 
phenomena the perception of the causation of qualitative changes an 
of animal locomotion are discussed, and the bearing of these findings 0 
the interpretation of different philosophical theories of causality is con- 
sidered. 

The book contains a wealth of stimulating experimentation and 
theory in a practically virgin field. The only question that arises in the 
reviewer's mind is whether some results are not conditioned by the 
limited possibilities of patterns which the method of the disk: allows. 

Fritz HEIDER-. 

University of Kansas. 


DELGADO, Honorio. La personalidad y el carécter. (2nd Ed.) Lima: 
Editorial Lumen, 1946. Pp. 207. 


The author of this brief but comprehensive précis of personality and 
character has every reason to be gratified over the need for a secon 
edition. Very likely his own influence as a distinguished psychologist 
and as editor of an important psychiatric journal had much to do wit 
producing a continued demand for the book, which deserves to become 
as widely known in North America as it evidently is in South Americ 
‘Added to the first idea of acquainting readers with the fruits of Dilthey ê 
tradition in the field of personality, is the hope expressed in the new 
preface that irresponsible forms of popular characterology may be over” 
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come through the kind of balanced scientific treatment that Delgado 
offers. 

Apart from the general theory of personality, covering structure, 
genesis, and dynamics, the plan of the book deals at length with some 
six typologies and with characterology as such. The limitations of all 
typological approaches are stressed, and there is a regular insistence on 
the cohesive nature of personality as well as a consistent demand for 
what Allport calls the idiographic approach. The typologies presented 
include temperament theory, and the schemes of Kretschmer, Jaensch, 
Pfahler, Jung, and Spranger. Such specialized studies as that of twins 
reared apart are reviewed in order to evaluate environmental factors. 
Freud’s contributions are not regarded as foundational. 

In the United States, books on personality frequently give short 
shrift to the topic of character. On the contrary, Delgado considers 
character at virtually every point in his discussion of personality, pre- 
senting the one as what amounts to an aspect of the other. To him 
character “‘in the strict sense, is personality manifested.” The entire 
general theory of personality and character, as the reader is notified in 
the preface, takes its departure from a conception of spiritual integra- 
tion rather than from biological considerations, and is frankly teleologi- 
cal in its implications. The fact that this will not interest certain stu- 
dents of personality in this country is perhaps a good reason in the long 
run for their becoming acquainted with Delgado’s approach, and for 
observing its amplified scope as compared with narrowly operational 
investigations. 

The changes between editions are all minor in nature, involving in- 
cidental corrections and a few small additions. The book contains well- 
arranged indexes and a useful bibliography. Its format is the tasteful 
and readable typography of the first edition. 

fe Howarp Davis Sporrt. 
American International College. 


Ling, Henry C. The rediscovery of morals, with special reference to race 
and class conflict. New York: Dutton, 1947, Pp. 223. 


The thesis of Max Weber was that a symbiotic relationship existed 
between Protestantism and Capitalism, so that they thrived together. 
Dubious as Weber’s hypothesis may have been, if confirmatory illustra- 
tion were wanted it could be found in this book by Dr. Link. The sub- 
stance of the moral theory presented is that the freedom and dignity of 
man rest upon the institution of private property—or, at least, that 
threats to private property are the greatest threats to freedom. 

The two moral sores festering in modern life are class conflict and 
race conflict. These are more or less equated as parallel problems, be- 
cause both arise out of failure to respect the worth of the individual, and 
in both cases the cause of conflict is falsely attributed to economic causes 
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instead of moral ones. A generally wholesome and humane position on 
the race issue is used alongside a violently anti-union and anti-socializa- 
tion position on the economic issue. On the‘one hand, the right of the 
Negro to a job parallels the right of the employer to make all the money 
he can. The moral law blesses both. On the other hand, the danger of a 
Fair Employment Practice Committee is that it might increase racial 
tension as the Wagner National Labor Relations Act has increased 


class strife. 

Despite a certain show of evidence in references to a 72-item bibliog- 
raphy, including the results of public opinion polls and tests of per- 
sonality quotients, the book is essentially an essay in defense of private 
enterprise, using the sanctions of decency, morals, and religion to bolster 
an argument. The tactics are those of the debater and propagandist, 
not those of the scientist, philosopher, or theologian. 

Ernest R. HILGARD. 


Stanford University. 
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